OOCOBBHT aBS9HB 



BD 103 M6 

ILOTROR 
TITLE 

lirSTlTOTION 

PUB D&TB 
NOTE 

BDRS PBICB 
DESCBXPTOBS 



lOEKTIPIBUS 
iiBSr?B&CT 



TH 00«l 277 

Hann, Peter B., Ed. 

A^ssessaent in Schools, Colleges and Society. 
Educational Testing Service, Princeton, N,J« Center 
for statewide Educational Assessaent. 
No? 73 
48p. 

MP-$0.76 HC-$1.95 PLUS POSTAGE 
colleges; ^Educational Assessnent; Educational 
Testing; *ETaluations ♦ETaluation Needs; Higher 
Education; *Hass Hedia; Occupational Choice; Progra* 
Evaluation; Testing 

♦Needs Assessnent; Social laplications 



Testing, assessaent, and evaluation, all based on 
■easureE?r»t, are assuping increasingly iaportant roles in Anerican 
society* They are becoaing a aore vital force in the world of vork, 
in education, and in the pr^esses of deter aining the effectiveness 
of social institutions. Educational Testing service (ETsy convened a 
conference for education editors and writers on ••Assessaent in 
schools. Colleges and Society" because newspapers, aagaaines, and 
journals^ after all, are the priaary conduits for inforaation flowing 
froa specialists to the various groups affected by their work. But 
costaunication with the general public, educators, and business and 
govefhaeht leaders is essential for specialists in aeasureaent. The 
succesis of their work depends to an extraordinary degree on the 
understanding of both those whose abilities and achieveaents are 
aeasured and those who use the results. Soae topics of discussion 
were evaluating educational prograas, forecasting assessaent needs in 
higher education, assessaent for career decisions, and roles and 
iaplications of assessaent for society. (Author/DEP) 



BEST COPY mum 




SCHOOLS, ©LLEGES & SOCIETY 




. report of a conference 

for 

education editors and writers 



1^ 


MATlOMAL»H$TlTUTEOf 

PERSON OROAN.ZM .ON OB-O-N 
ItfNO.T POINTS Of V.6* OB OP.N.ONi 
tiMED 00 NOT NECESSAB'l; BEPBE 
SENT 0° P.CIAL NAT,ONAL 'r^V'-'-'E Of 
EDUCATION POS.T.ON 0» POKCY 


RtGMTtO VATtRlAi HAf, Hf£N GMANTtD Bf 

. * '^^^vIS^ — ^ 

TO £«"C AND ORGANf/AT.(>N'^ iiPFRA1iN(» 
UNOrR AGf»&eM£M*, WiTM TmF MAt.wNAt rN 
«,Ti'u''f Of fDU';ATj(>»j s^.utMli. PH w . 
t.UCT.ON OuTSlDI •►-r l«" >^T{M Mt 

DWNf f< 




t 






t'jjtttJ l)y Pt'tiT B. MeifUl 




Novofnbtff n 1?. 1973 






Center 


EdiKiitiDnal TesttrtQ SrfvjCf* 


Prim.eton. Now Jerst»v 0B540 



erIo 



2j3 



Educational Testing Service is an Hqual Opporttmity Employer. 
<*up>right IV l<»74hy Fducatinnui Testings Service. Alt rights reserved. 



4 



BEST COPY AVAILABLE 



CONTENTS 



Introduction 

Measurement and lis Uses 

Wi'Uiam W. TurnhuU 

Where Are We (Joing? 

Samuel Halt 

Fvaluaiing l Uucational Programs 

Samuel Mvsui'k 

Statesmen, Scientists, and Soothsayers. 
Forecasting Assessment Needs in Higher Kducation 



Assessment ft>r C areer Decisions and Roles 
Wintifn It. Manning 

Implications of Assessment tor Society . 
MvMn M. Tumm 

A took at Fducati(>n Reporting . » . . . 
Fred U. llevlun^vr 

Some Points of Interejit • • • 

Roster of Participants 



28 

42 
46 



ERIC 



5 



Introduction 



Testing, assessment, and evaluation all based on measurement are assuming 
jncreasin^^y important roles in American society. They are becoming a more vital 
force in the world of work, in education, and in the processes of determining the 
effectiveness of social institutions. Kducational Testing Service is an organisation 
devoted fu the responsible development and use of measurement technology and 
techniques. In this time of change, then, it was ^appropriate for ETS to convene a 
conference for education editors and writers on Assessment in Svlwols, Colleges, 
and Society. 

Newspapers, magazines, and journ; ^s, after all, are the primary conduits for 
information flowing from specialists to the various groups affected by their work. 
That is true of specialists of any kim^ but communication with the general public* 
educators, and business and government leaders is essential for specialists in 
measurement. The success of their work depends to an extraordinary degree on the 
understanding of both those whose abilities and achievements are measured and 
those who use the results. And the stakes are often high: college admission pr 
employment for the individual, gain or loss of public confidence for the institution. 

The 1*^*^3 conference was enlightening I'TS staff members provided the journal- 
ists with up-to-date information on the status of. and prospects for, measurement; 
the journalists gave staff members fresh insights into the layman's view of their 
field. The conference was also eryoyable. I^s setting was the new ETS Henry 
Chauncey Conference Center in Princeton, and the atmosphere was pleasantly 
informal. For jtaff members, there was the added satisfaction that the conference 
was one of the several events marking the 25th anniversary of ETS and that the 
focus was not on the successes of the past but the challenges of the future. 

This report is an effort to lecord the essence of the conference, to provide not 
only easy recall for the participants but information for those who could not 
attend. There is W attempt here to capture all that was said. The papers presented 
by the seven main speakers, reproduced in full, are followed by a brief summary of 
the m^or concerns. Some Points of Interest, that emerged from the discussions. 

Unfortunately, I was unable to attend the conference because of a throat 
infection. My paper. Measurement and' Its Uses, was gi\ ^ and improved- by 
Scarvia Anderson, an ETS vice president and director of the Atlanta Office and 
co-chairman of the conference with Jane D. Wtrsig. executive director of Informa- 
tion Services and Publications. 

When she finished reading my paper. Miss Anderson noted that it contained a 
definition of measurement and said this word and three others - testing, assessment. 
and evaluation - have specialized meanings and would be used often during the 
conference. She proceeded to offer quick definitions, and at the next session they 
were amplified by another ETS vice president, Samuel Messick* The distinctions 
between these related terms are certainly as important to the reader as they were to. 
the conference participants. So here is an amalgam 6f the definitions: 

Measurement is systematic observation of behavior under specified circumstances, 
leading to results that are expressed in meaningful terms, usually quantitative, and 
that can be reproduced at different times and places. Measurements can be taken in 



\ 



\ 



many ways tests» quieslionnaires, interviews, observations, ratings, instrum^ts 
that recorU^hysiological Responses, and so on. 

Tesnng is-one kind of ni«asuremcnt in whkh specific instruments (often of the 
paper-and-pencil variety* are us^U to gauge an individual s abtlitj', achievement, 
interests, personality traits* or other characteristics. Usually, a test focuses on \ 
single variable, but several variables can be measured simultaneously by admin* 
istering a battery of tests or \an omnibus test involving subscores. In the aggregate, y 
test results can be used to chatactcri/e groups, such as classrooms or schools. 

Assessment is a more comprehensive process. Frequently, several tests are 
involved as well as questionnaires, interviews, and observations, in order to obtain a 
broad picture of a person* program, or situation. 

tvatmrioh adds to assessment the element of judgment. After the assessment 
results arc in, the ivaluators apply standards of worth or acceptability to them for 
example, do the results indicate that this program is effective in accomplishing its 
purpose? Tests and other measures, while they may provide the basis for eva4;^ation, 
are nof^in themselves evaluations. 

it IS hoped that the reader, armed with this knowledge, will proceed uiTh relative 
ease through these papers. If so, the reward should be a broader view and a deeper 
understanding of measurement in its multiple manifestations. 



Willmm W. TurnbuU 
President, bTS 



7 



Me^urement and Its Uses 



By William W. Turnbull 

in any discussion of testing or measurement, or assessment « one of the problems is 
that those terms conjure up quite different things to different people. 

It is as if someone said let*» discuss mvanx of nansponation , and one person 
thought about space craft, another about airplanes, another about trains, an^ so on 
through bicycles and pogo sticks. There are probably some generalizations to be 
made about all these forms of transportation for a particular purpose. 

At a high level of generality, the conversation may go along quite well. Suppose 
one man thinks trmn while another thinks;' 'go stuk. They may agree that it beats 
walking, that fatigue sets in after a while, that the metal ones are much better than 
the wooden ones, op even that it's great to hop on one but fierce and futile 
arguments may break out when one party says they should be nationalized. 

I'd like yuu to visualiaie several different kinds of situations. Last Saturdi^y in 
Omaha, Nebraska, more than 200 high school students took the Scholastic 
Aptitude Test. By definition, that's a test. Last week, in our Infant Laboratory, 
Michael Lewis patiently showed a four-month-old child a series of colored slides 
while recording the infant's pulse, respiration, and HHG. Although the child didn^t. 
know it, her degree of atlentiveness (or boredom) was being measured and related 
to the kind of picture being shown. 

In CaUfornia list spring, more than 100 colleges made. use of something called the 
Institutional Goals Inventory a questionnaire designed to elicit the aspirations of 
students, faculty, administrators, and trustees for the same institution. Here, the 
object of scrutiny is an institution rather than a person. Just the same, the process 
is broadly one of measurement. 

Ar^d so It goes. Measurement encompasses the very young and the very old; 
tradition-il study, as in a classroom test of spelling, and nontraditionai, as in 
assessment of the educational value of a young person's Peace Corps experience. It 
refers to attributes both intellectual and temperamental. It takes different forms 
and conditions, depending on whether it's for guidance, evaluation, amusement. 
self*>understanding, selection, placement, or credit and on whether an individual is 
to he measured or a group of people, an institution, a school system, a course of 
study. I always liked Scarvia Anderabn's paper on how to measure the drawing 
power of a museum display. Its title is "NoseprlfHs^oh'tlie GlassT'^ 

You may be getting the idea that I include any observation at all under the 
measurement rubric. Not quite. Measurement must be systematic enough to possess 
several attributes. The circumstances under which the behavior or events occurred 
must be specified, the characteristics under observation need to be identified, the 
results must be expressed in meaningful terms (whether quantitative or notK and all 
the procedures must be described and followed with enough precision so that yor 
or someone else could r^jpeat the whole process and come up with reasonably 
similar results. 

Much of the research on measurement itself has had lo do with how to achieve 
those characteristics using a wide array of techniques in a great variety of situations. 
The academic disciplines surrounding measurement have grown up around just such 
questions. 

.7 

8 



Measurement and its U««}« 



Through thoUhanUn of studies, the answers have begun to come in. The results 
point clearly to ihc mcasurability of a very vviJe spectrum of human abilities, 
interests, values, and attainments, although the measurement techniques that work 
best vary depending on what you want to measure. (Of course, no s\*nsible person 
would have thought otherwise.) 

The kinds of tests that have been most prominent in popular discussions over the 
past couple of decades have been inielligence tests and the scholastic aptitude and 
achievement tests used for college entrance. 

The IQ has worked a great deal of mischief in education and society. The reasons 
are several but stem mainly from the unfortunate and quite erroneous interpreta- 
tions that have come to be associated with the term. The IQ is assumed to be an 
innate ch;jracteristic, which is certainly false in considerable measure, although 
nobody kni^ws hi>w nmch» It is beheved to be unchanging through the person*s life, 
which it is not. It is looked upon as a global, unitary trait, which certainly is untrue; 
questions tm IQ tests usually tap a number of different skills and abilities. IttJ score 
points have an aura of exactitude they do not deserve: like all other measures in the 
human domain, they are approximations* afflicted by random error. Two different 
kinds of IQ tesl^ may give markedly different results, quite systematically, over and 
above the random error. 

For those and u.her rf*a.< >ns, you may not be too surprisel to lea^a that IQs and 
10 tests are held in rather low regard in FTS quarters. In fa 't, fcfS neither makes 
nor gives IQ tests, nor do we report IQ-equivalent scores. If none of you wish to , 
mention IQs, you will not be disappointing anyone here. We are, of course, glad to 
discuss them, but you need have no fear of offending if your attitude is critical. 



Measures of particular abihtie'. and attainments, on the other hand, are somewhat 
less laden with built-in errors of interpretation, although certainly they are not 
immune to them. 

Back in the l^20s» Carl Brigham devised something he called a scholastic 
aptitude test a rather general label, but one that he considered much mor^ 
specific than tne label inteUigencv test. His purpose was solely to iTjeasure the 
extent to which a student possessed the abilities needed for successful pursuit of 
cottege-levct studies. There was no implication that these abilities were inborn or 
immutable. Moreover, the composition of the test was experimental, even though 
it was guided by Dr. Brigh-^m's hunches about what college students ought to be 
able to do. 

He had some good hunches and some bad ones. Among the latter was the guess 
^ that good students would be better able to read people*s emotions from their 
expressions than would poor students and so he included a test section in which 
the problem was to pick a word that described the emotion being expressed by a 
face shown in a line drawing. When he followed up to see whether or not there was 




8 



ERJC 



. 9 



WiiiiimW.Turnbuli 



in fact a correlation, he found there was none. The ability to read expressions 
ipptrently had nothing rai Ho with scholastic success. 

Among his good guesses th« idea that abUity to reason with material 
expressed in words~or verbal anai-Wbuld be important in coBege study. It turned 
out that verbal skill was probably the single most dependable index of readiness to 
pursue a university program successfully. After a few years of fvrther experimenta- 
tion and foliow-up research, he found that mathematical s^ciUs were similarly useful 
as an index of probable success in certain kinds of college courses. 

BriBham's work was done largely in the 19208 and 1930s. Smce that time, we 
have explored many varieties of measures for various purposes, but it has proved 
exceedingly difficult to find skills that have greater general utility ~tJlun verbal and 
mathematical abllity-if your , purpose is to predict scholastic success as measured 
by college grades. 

The statement I Just made needs to be examined and tested in several different 
contexts, and this has in fact been the subject of a good deal of research. Let me 
pursue the logic for a few more- moments. 

Firet, it seems to hold true at all score levels. People sometimes say that If a 
student has an SAT score of 600, or some other magic number, he has all it 
takes-and other qualities, like motivation, will determine' his success. I have no 
doubt that motivation is indeed exceedingly important, but it appears to operate at 
all score levels rather than to determine success only at the high end of the scale. A 
student who scores 600 is mons likely to do well than one wAto scores 500. A 700 
student is a better bet than a 600. An 800 is a better bet than a 700. The 
correlation runs ri^t up the score scaie. 

Second, the relationship seems to hold up even wlien the grdup is heterogeneous 
as to such basic characteristics as sex, race, and economic circumsunce. That is, by 
and lai«e, a rich white student and a poor black student who make the s&me SAT 
scores have an equal probability of succeeding in the same college program. This 
conclusion is contrary to popular belief, and so I think it is important to put it on 
thetablp. 

Now let me go on quickly to cite some Inferences that should not be drawn from 
the preeminence of verbal and quantitative abilities in predicting college success. 
First, it should not be inferred that SAT scores are perfectly precise. In fact, they 
are not: it is most unlikely that an individual tested twice would receive the same 
score both times. Second, what I said about the likelihood of success in college 
refers to performance averaged over a group, not to individual perfonnance._AniL_ 
sin^e person may do much better or miich worse tnan othefs with similar scores. A 
third incorrect inference is that SAT swres measure all the characteristics that are 
basic to college success. That certainly >s not the rase. 

A fourth unjustified conclusion is that SAT scores are perfectly fair to everyone, 
since any given score indicates the same probability ot success tor members of quite 
different groups. This means that the test relates closely to the requirements for 
success in college as it Is lAresently organised and conducted. If, in coUeg&study, the 
dice are now loaded against some students and in favor of others, the loadUig is 

ERIC ' _ 10 ^ ' 



MeaMiremenfand Its Use* 



probably the ^ame in the SA I . As college study undergoes changes, it may well be 
that the measures used to predict siiccess m college measures thai have worked 
well will ne^d to he revised or replaced with other k^nds of measures. 

I have spent this much hme on the SAT not because it has overwhelming 
educational importance but because it has been the subject of a great deal of 
misinformation and misunderstanding. Over the past 20 years or so, the testing of 
individual students for the purpose of sekviion by an institution has commanded 
the greatest amount of public attention and discussion. This has resulted in part 
from the fact that the educacional establishment wa^» underbuilt, so there was a 
shortage of places in relation !o the burgeoning dem^^nd for further education. 
Now the situation is radically different. Supply has caught up with demand. 
Although there are sMIl a few schools and colleges that are turning students 
away, their numbcf has dwindled rapidly. Recruitment has replaced selection as 
the preoccupation of most institutions. 



Helping Studentii Make Choices 

Under these circumstances, the kinds of measures that are needed are different, too. 
Increasingly, the need is for information that will help students make the choices 
that confront them information about themselves and about the institutions they 
might want tt> attend. For colleges, the new nted is for information to help them 
place students appropriately and to grant academic credit for learning accomplished 
outside the framework of regular courses. How much credit should a student be 
given if he or she has picked up the basics of accounting by working in a CPA\ 
office but without taking a college course? What is the academic value of a year in 
the Peace Corps? 

If all learning fell naturally into academic dis^ciphnes and subdivided neatly into 
courses* we would be well along the path to meeting this growing measurement 
need. The College-Level hxamination Program (CLhPK pioneered by FTS and 
sponsored by ^the College Entrance hxamination Board* represents a major 
accomplishment: the translation of informal learning into equivalent academic 
credits. For someone entering a standard degree program. CLHP clearly provides a 
valuable means of obtaining credit for applicable experience. 
-Aiiiid the current concern about career education, it is interesting to note that the 



ERIC 



relationship of education and work moves in two direct ioni;, and measurement is 
becoming an increasingly important tool in both. Pve already pointed out the rising 
need fpr measuring the educational value of work experience. That s one direction 
The other is measuring the on*the-job value of education or training. 

How well prepared is the law school graduate to practice law? How competent is 
• the auto mechanic whether trained at a community college or a vocational 
school to repair cars? Tcrtainly, measuring competence for professional certifica- 
tion and occupational licensing is not new, but it is being extended to new fields. 
Auto mechanics is one example. Others range from law enforcement to hospital 



>0 



Wniiam W. Turnbutr 



financial management to real estate. FTS>^*inlo" these fields plas marry ^t^crs. 
One of our latest assignments, by way of illustration, is to develop jrwSrns of 
assessing the proficiency of golf p^pir for the Professioftal Coif Association. 

This increased emphasis on.w^a^iirihg the results of format education and training 
brin^js to mind severaUelated points. As I mentioned earlier, testing us i basis for 
admission to second^ Schools, and college is no longer growing. But it is far from 
finished. Some institutions will remain seleitive by choice so long as they remain in 
sufficiem demand to survive. They are not yet and may ''never be threatened by 
higher education's transformation into a buyers* market. Additionally, selection 
remains a Vital testing function at the graduate and profo;KionaifleveL especially in 
those fields where demand is still greater than supply. Law school and medical 
r»chool are the readiest examples. » 

While colleges are seeking valid measuring devices for purposes other than 
selection for guidance, placement, and 'awarding' credit -these needs are also 
emerging earlier m the chain of formal ^ducalian^ Means of gauging individual 
learning progress ^ re now considere4 UnpurtanVnot only at the secondary and 
elementary levels, hut in preschooK and kindergarten's. In tact, increasing attention 
is being focused on the assessment of preschool youngsters. 

Aside from assessing the progress, of students at alt levels of education^ there is a 
strong trend toward the use of testing to evaluate the effectiveness of schools, 
school systems, tolteges, and programs such as Sesame Sfreei and The Electric 
Compafiv, This trend is clearly a response to popular pressure, both for the 
improvement of public education and for the increased accountability of the 
educational e&tatlishment to the taxpayers. 

Without lingering too long on the assessment of institutional effectiveness as 
distinct from the assessment of individual student progress, I should eite- the 
interrelatedness of the two kinds of effort. For the individual student, test tesults 
can be helpful if they are used to guide advancement through the learning maze. 
tailoE^ programs to needs and interests, and alert teachers to learning gaRs. For the 
school or college, test results can provide an index of strengths and weaknesses in 
the institutional endeavor.' If this index is used to improve the enterprise, , the 
beneficiaries will be the students of the future. My point is that raeasutement of 
both types aims ultimately at improving education for the individual student, either 
today or tomorrow. 



II 



ERIC 



12 



Where Ara We Going? 
* BySimuelBall 



Let me tell you first of aU what t am supposed to be talking about: Educational 
GaaJ^.Obfectiyes, and Tests. ^ 

I had originally entitled this speech '*Hans Christian Andetson, A Visi^ to the 
Steel Works, and M^by Dick: Where Are We Going, Where Do We Think We Are 
Going, and How WiU We Know When We Get There?*' But I was told that was too 
long. So I shortened it to C. Anderson.and M. Dick;!' but I was told that wa^ 
too ^cryptic. So we finals settled on '*Where Are We.Goiiig?'* Thafs a clear 
question-but I am not sure the ansWer feclearr — ' ' - - ^ 

And no^^ for the story of Hans Christian Andersom'. though I doubt whether it is 
true. One day Hana was walking along the street when he met the king. **Ij[eUo» 
where are you going?** said the kingly perfect Danish* *i don*t know,/* said Hans^ 
vaguely. The king grew'angry because kings like io have straight answers so they can 
lower the boom if they have to. "Doi^*t give me that nonsense/* said the king. 
'•Where are you^oing?** "But 1 really dori*t know " replied Hans. Enraged, the king 
ordered H. C. Anderson clapped into prison for contempt of court, as it were. The 
following week the king vuiited Hans, still in prison. "Now are you wiping to tell n>e 
where you were going?** And Hans replied. *i told the truth, for I did not know, 
where I was going. How was I to know I was going to gaol?** 

In 1955, I taught sixth grade at Clemton P&rk Public School, having graduated 
from teaching third and fourth grades for the three previous years. I had to teach 
my sixth-graders about Australian secondary Industrie, so one of .the thin^ I did 
was to take them on a trip to the steel worlo at WoUongong inr the Hl^arra district 
south of Sydney. We left early in the morning and got home pretty late, and we 
spent some days beforehand in preparation and afterwards on foUovk^^tp activities. 
A few weeks later, J had my annual visit by the district inspectoi; of schools. 
'*Why,** he asked, '*did you spend all that time on the visit to the steel w^rics?** 

**Because I wanted the class to know how steel is made.** 

''Aha,** said he with a malevolent inspectorial glint in his eye. *'Don*t you realize 
that ,you could have told them how steel is made in much less time than taking 
them on this trip?^* 

*Tes, but they also l^med how to get along with each other, and the need to 
thank their hosts at the steel works, and the importance of planning trips..,.** 

And then my inspector pointed out that they were not the reasons I had taken 
^the children to the steel works. ''You are shifting your ground,** he pointed out 
with some logic. 

^foby Dick is one of those classics that is frequently ruirtcd in high school. Paui 
Diederich of ETS tdls the story of'the time he was visiting a high school class where 
the teacher was very proud of his difficult examinations, which he saw as being 
indicative of his high standards. The class had been studying Moby Dlck^ and Paul 
was intrigued at the. questions oi\the class examination. *'Name and describe the 
various species of whM^ en'countered by Captain Ahab.** ^*Name and give an 
accou^^ of the ships encountered by Captain Ahab and do so in the ordet they were 
encountered.** Welf, it could have been wor8e-;;;j>n another exam was th<$ question, 
"What was the name of PoUy*s cat in Tom SawyerV (And last year my daughter 

12 



ERJC ^ " . 13 



Samuel Ball 



came home with her social studies test on which was ^ne chestnut, **\ySien did 
Columbus discover. America?** and one atrocity, •^Which troops in the Revolu- 
tionary War were not bravtf?** The answer, 1 discovered, was ••the Bfritish.**) 



Goals and Objectives 

let me Return later to. Hans Christian A nd«*fson, a visit to the steel works, and 
Moby DU k , and let me now talk about goals and objectives.^ I like the convention of 
using ••goal*' to denote the long-term destination we seck^o arrive at toward the 
end of an educational journey. Goals are important becaus^if we.don*t have the 
long-term destination in mind, the educational processes >^ design will most 
probably become a spaghetti-like, chaotic set of worms. 

Of course, it is impossible not to have some goals in mind, but ij is easy to^eep 
them implicit. Once we try to make them explicit ,nhe sins of our thinking are laid 
hare, it has been fashionable since John Dewey*s Democracy and Education was^ 
published over 55 years ago to talk about our pluralistic society. But one problem 
with a -pluralistic society is that it enables us to espouse many, competing and 
contradictory goals all at the same time. And as the Mormons discovered, there are 
problems when you espouse too much at one ttmeS^^^e,loj,jW^^ the days 
before Stxame Sinci went on the air, when the goals oTtUS^ow were still being 
considered. There seemed to be general consensus that the show should teach 
preschoolers sharing and cooperation. But a black psychiatrist raised the issuer- 
should black children learn to b? cooperative and to* share in general? What about 
when they were being put upon? Should they cooperate theri? What if they had 
been systematically excluded? Should *they share at most on a selective 
h;4MN \)uattfie<] sharing? * • / 

T>r consider, in real life, the g6als of doing bne*s duty and respect for legiumate 
dk-mv; ratic aofroriiy. .Remember recently General Haig telling Mr, Ruck^lshaus, 

* 'Yovif r'oaimander-tnChief orders you\o. da so,*** And remember the response- 
.•'Nu' .rnl the subsequent firing?. Certainly the majority of America wants to 
. educ^tr h.Mren \o respect authprity, but it also wants its children to be moral and 

have hij;tu; i p*es. In schools, teachefs want th^ir students to learn what they*re 
bemg XtxM^h ■ r fhfy also want their students to be questioning and creative. 
Some of us wa:.' : t^'jk to grow up to be good' wives and mothers, while others of 
us want t>uf gjfU t w lo be good human beings (and let them subsequently 
choose the speciftc rou-^ U'u . wv jnt t(i assume)* |n any case/there is conflict amonj • 
our goals, partly because wc livv m j pluralistic society and differenjt group^i have ^ 

* different desires, and partly bclau^e withm ourselves we have not thought through 
adequately what it is we want to do. 

I will use ••objective** to denote the short-term destination-each of the mile* 

N stones we must pass in order to reach the goal. Objectives are impprtant because 
without them we have no useful way of knowing whether we are on track. When we 
look at objectives, we see the conflicts among the goals in even clearer focus. We 

13 

14 



Where Are We Gom§? 



want our children to be cooperative, generous, and independent (one set of goalsK 
and we also wai)i them to have good school spirit ianother^ind of gouU. We set up 
an objective to have a first-rate high school or college football team that will beat 
the oi'pcsition, deny the opposition the chance to score; and alt this in the name of 
generating "schbol spirit/' But we *^nore the effects on the other goals, generosity 
and^'independence, and we don*t even consider the question: If the goal is coopera- 
tion, with whom should we learn to" cooperate^ our friends on our football team or 
the strangers from the other school's football team? 

Of course, it !s>easy to point out these inconsistencies between and among goats 
and objectives. We want good health for all our students but put a vast share of the 
money for our physical education program into the needs of a few students so we 
can have good school teams. We want our students to learn to be good citizens, but 
we give them a school so organized that they get little opportunity to practice the 
skills of a good citizen. Actually, I think our objectives are much nearer to the 
reality we want in our society than the nice-sounding goals. Do we really want our 
citizens thinking critically, behaving independently, working creatively? What 
would this do to our government and our factories? (Maybe this should be thought 
of as a cynical aside.) 

Which brings me to the point of considering the testa we present to our children. 
These* for most students and teachers, are the real expressions of bur goals and 
objectives. And here there is even worse cause for disquiet. We test what we find 
easy to test. We find it easiest to test for achievement of factual knowledge and 
relatively easy to test for the achievement of understanding of that factual 
knowledge. But even here there is an inherent problem' that we must face-the 
problem of validity. Are the achievement tests we use actually measuring the 
knowledge and skills we think they are? Just because a test is labeled **mathe* 
matics,'' is it measuring achievement in the mathematics we are teaching in the 
schools? Are the reading tests- whose results are so widely published (and so 
outrageously fudged) in fact testing reading Achievement? They m^y be- but they 
may also b^ assessing anxiety level, ability to follow complicated directions, 
vocabulary level, and perceptual skills. Have you ever looked critically at a reading 
test? Are the directions appropriately written for a tHird^rader? Is The test timed? 
What kinds; of questions are asked? Are they related to the instructional objectives 
and educational goals? How tnany items are provided on the one page? We simply 
don't look hard enough at the test items and the tests from which we make our 
generalisations about school achievement. Thus, we think it is relatively easy to test 
*br.:cognittve and academic acKiev«;ment, and it is. But it is not as easy to do these 
things well. . ' * 

^ ^ Attitudes and Feelinf^ 

We find it everf more 'difficult to assess attitudes and feelings', and we often ignore 
completely th^ need to assess non-^est behavior. We often work on the tarnished 

14 ^ 

15 



Samuel Sail 



principle that if studentji know what to do, they will want to do it. But we know 
that teaching students so that they know and comprehend what is in the Bill of 
Rights will not necessarily mean they will support the bilPs precepts when it comes 
to specific issues. And we know that knowing the story of the good S.'»maritan does 
not ensure that those so knowledgeable will behave like the good Samaritan, Why 
do we teach children civics? So they can show they know what is in the lessons? Or 
so they will feel and behave in certain ways? As your friendly Blue Cross and Blue 
Shield association says: *There*s more to good health than paying the doctor's 
biHs,** So» too, there's more to good education than passing tests of knowledge and 
comprehension. We have means of assessing other domains, but we tend to neglect 
the use of these means. 

Just as I was finished writing this speech, I got a letter from my nephew in 
Australia and it was so appropriate I thought I would introduce it here (in part, of 
course). 

Dear Uncle Sam, 

Vm enjoying school but I don*t like the school work. 

Once a week we have civics lessons. Miss Smith tells Ufi about some bluke that 
was real good, f like the stories but I can*t remember everything about them, and 
I get the questions wrong at the weekly test. I can never remember what country 
l ather Dkmien came from or what year it was when Albert Schweitzer was born. 
It really is a big problem. Miss Smith told me I was a bad boy and Pd never make 
a good citizen unless 1 pulled my socks up. Welt, it's true I hate civics* but I do * 
want to be a good citizen. I hate civics more since she kept me in to learn stuff 
about Florence Nightingale yesterday and that made me real mad because I'd 
promised to help the Matron at the Wagga Wagga Methodist Home for Orphans. 
Vm going to make some blocks for the little kids. The man in the hardware is 
letting me have some paint, and I Ve organized some of the other kids at school to 
help me do the painting. I've got an idea about making other toys with three ply 
and fretsaws after we*ve made the blocks,4 sure hatc-civics, though. I have shop, 
too. Last week we had to draw a capital L in a book and then do it a second time 
on coloured paper and cut it out. I ruined my job because in the drawing book I 
put my base line one inch up from the bottom instead of two inches. So I got an 
F for 5^op.* 

So where are we going? Hans Christian Anderson pointed cut truly that he did 
not know where he was going. 1 doubt whether, despite all the work carried out on 
goals statements (and they are very numerous), we understand where we are going 



♦This letter H based on "The Poor Scholar & SotHoquy" by Stephen M. Corey, published in 
ChitdHood Education. Vol. 20 (January, 1 944). Pj>. 2 1 9 220. 

15 



. il6 



Where Are We Going P 



in education either. Part of the reason th^t there are external pressures that will 
put us off course anyway, rather as the king moved Hans off course. But another 
part of the reason is that we have been rather soft-headed. We have not confronted 
the moral, ethical, psychologicaJ, social, and political issues that arise when we look 
at the conflicts among legitimate goals in a pluralistic society. We glory in compre* 
hensive, melting-pot high schools with common cores in the curriculum for all 
^ 'students and then point out how good it is to individualize instruction. 

And ^ere many think we are going (our goals) may (I9fl>e where we are in fact 
going because our objectives simply may not be attuncdr^o our goais. A.s a famous 
attorney general of the United States John Mitchell -once said, **Judge us not by 
what we say, but judge us by what we do.'* By the same token, look at what is 
happening to find out where we are really going in the schools. (Do not look too 
long at where we say we are going.) The fact is that the reasons for the trip to the 
.steel works for class 6B in I^SS could have been best seen by looking at what was 
happening in the classroom (the daily objectives in operation). I still think that 
inspwtor wasn*t exactly fair. 

And finally there's Moby Dick and my Australian nephew and the problem of 
tests and assessments. And how will we know when we have reached our goals and 
objectives?' While I admit that Moby Dick and my Australian nephew provide 
examples of the worst kind, they also suggest better ways% We can assess an 
appreciation of Moby Dick better than our rigorous teacher, and my nephew has 
presented, unwittingly, some ideas on indicators we could use to test the^ 
effectiveness of our education. 

As I see it, I have raised a number of important educational questions and 
problems about goals, objectives, and tests. I have not answered these questions nor 
have I solved the problems. To do so at this point would hardly be fair. After all, 
we have the rest of this day, five more speakers and eight more discussants to go. 
And even if we cannot answer and solve, them all in this time and with these 
resources, maybe there is some good in having raised them anyway. It might point 
the way to where we should be going. 



16 



Evaluating Educational Programs 

By Sdmuet Messick 



There are two tndjor reasons why we might want to evaluate educational programs. 
One lsTor purposes of program improvtment and system renewal: to gain increased 
understanding of processes both producing and impeding effectiveness so that 
rational changes might be made to improve program functioning. The other is for 
purposes of accountability: to gain information about the relative effectiveness of 
programs so that rational administrative decisions might be made in allocating 
resources. » 

Not coincidentally, there are also two methodological approaches to evaluation 
extant in the technology of the field. One is Cdlkd formative evaluation, or ew^l^Sh-^ 
tion for program development and betterment. The other is called suntmative 
evaluation, or overall appraisal of a final product. Given two approaches and two 
purposes and the' obvimis congruence between formative evaluation as an approach 
and program improvement as a purpose, it is <*asy to fall into the position of 
accepting summative evaluation as the natural approach for the other major 
purpose of evaluation, accountability. 

In this paper I will argue against this tendency to tie accountabittty to the overall 
appraisal of program outcomes, especially as iypicatly implemented in what might 
be called the economic model of program evaluation, which focuses upon gains or 
input -output differences relative to cost, ft is true that this approach does enable us 
to ascertain the degree to which a program works and to cull out those programs 
that do not work or are not worth the cost. What better basis could we have for 
determming accountability? ' But the time perspective is wrong for constructive 
accountability. This summative approach is oriented toward the past. U tells us that 
a program was relatively cost effective and hence is probably worth continuing- 
provided, of course, that circumstances do not change. In this framework, 
accountability is primarily negative -educators may be held responsible for what 
happened, but they are not given rhuch help in determtniiTg why it happened or 
wfiat to do to rectify matters. 

What is needed is an approach to accountability that is oriented toward the 
future, that goes beyond an overall assessment of the size of effect? to an investiga- 
tion of the processes that produce the effects. Only an understanding of these 
processes * will provide a rational basis for changing programs to improve 
effectiveness and to accommodateto changing conditions. 

What is needed is an acco4ntabiUty system that views decision making as problem 
solving, that provides an empirical groundwork for diagnosing inadequate educa- 
tional progress and for prescribing plans for corrective action. In this framework, 
accountability is basically positive -educators are held responsible for taking Qorrec 
tive action when goals are not achieved. Such an approach has besn suggested in the 
feTS design for an accountability system for the New York City school system (I)- 

From the present perspective, educational evaluation should involve research on 
both prqcess and outcome in order to accrue information relevant for bqth program 
improvement and administrative decision jnaking. At first glance this may seem like 
a blurring of the useful distinction between formative and summative evaluation, 
and to some extent it is -but there really is nd fundamental conflict involved. The 

• [ ^.17. 



ERLC 



18 



Evaluating Educational ProQrams 



terms fhrmattvc and suntniatne refer to different roles of j*vaIuation, not to 
different forms of evaluation, and data from a particular evaluation study can and 
usually should play both roles. Indeed, what is needed is a system of continuous 
evaluation entailing recurrent formative and summative elements to provide in 
ongoing hasis of current understanding from which to make rational and adaptive 
changes. Such a system, as we shall see, tnust include. a comprehensive measurement 
and informa*ioh program thai goes beyond a consideration of educational treat* 
ments per se to an assessment of the hroader social context within which those 
treatments operate. 



The Need for A System Perspective 

Inherent in the concept of accountability is the notion that the individual^ who Is 
accountable is a causal agent. But educational effects are multiply determined and 
sometimes tncrdelermined. Given multiple causes, there are also likely to be 
multiple agents responsible for these effects, whether desirable or undesirable. No 
equitable system of accountability can ignore these complexities and inter- 
dependencies. In all fairness, you can only hold a person accountable for what he 
does or fails to do, not for what someone else does or (ails to do. This phenomenon 
of multiple causation of educational outvomes introduces considerable complic^i- 
tion into our efforts to understand program functioning, for it requires us to 
unravel the sources of caasal influence in assessing responsibility for outcomes. This 
problem highlights the need for a system persp*!ctive in educational evaluation. 

At this point* then, it may be helpful to make explicit some properties of the 
educational system that are usually either taken for granted or overlooked. To 
begin with, the educational enterprise is, indeed, a system in the technical sense of a 
complex set of elements that functions as a whole by virtue of the interdependence 
of its parts -There are many parties to the system, and many parts and processes. 
The system is composed of differentiated but overlapping subsystems that embrace 
the student, the family, the community, and various peer groups, as well as the 
school, the teachers, and the programs. Since the concept of system implies a - 
functioning whole whose various elements and subsystems are interdependent, it 
follows that a change in one part of the system may produce unanticipated and 
possible adverse consequences in aniother part of the system. Thus, it is not enough 
to evaluate an cducatior^al program solely in terms of its expected effects - we must 
also assess a wide range of possible effects. Indeed, the side effects of an 
educational program are frequently more important than the intended effects. 

And the situation is even more complicated than this, for education or schooling 
is not only a system but a culture. The existing features and regularities are 
determined and maintained bjTroles and expectations, attitudes and values, tradi- 
tion and* history Under these circumstances, aa understanding of program 
effects requires not only knowledge about person and program variables but also 
about social and institutional variables. 

18 - - 



.19 



Samuel M^ick 



One implication of this recognitic.^ of conplex system interdependencies is that 
educational evaluation should rout; .*ly adopt a.muUivariate interactional strategy : 
it should employ multiple measures tn each domain or subsystem and methods of 
analysis sensitive to interacting influences (3. 4). Within this framework, research 
questions become elaborated from -.traightforward comparisons, like *is this new 
program more effective than the old one?" to something more complicated, like 
"Do these programs or program components interact with personahty and cognitive 
characteristics of the students or with factors in their educational histories or 
family backgrounds to producj differential effects upon growth?" Note that such 
an elaboration is important, for if trait-by-treatment interactions occur (that is. if 
different people respond in different ways to different kinds of programs), or if 
background factors- turn out to moderate program effects, then the typical simple 
con-parison of average gains for different groups will very likely be misleading. 

In contrast to the economic model of evaluation mentioned earUer. this systems 
approach has been characterized as a medical model, in that it emphasizes side 
effects as well as intended effects and an understanding of causal processes as well 
as the gauging of treatment outcomes (5). . , »• 

Some might object at this point that this particular view of educational evaluation 
is overlv elaborate and complicated, that with all this emphasis on process and 
interacting influences it runs the risk of subverting the main aim of evaluation, 
which is an appraisal of outcome or product. This latter stance has come to be 
called in some circles the "butter-wrapping model" of evaluation (6): if you have 
taught someone how to wrap butter, the main thing you want to know is how 
many pounds of butter he can wrap. No matter if different processes of butter- 
wrappftg are employed by different butter-wrappers (one may use his right thumb, 
another his left forefinger). What matters is how many pounds of butter he can 
wrap after the training experience. But suppose one very effective butter-wrapper 
has gained his speed and efficiency because of a particular styUstic quirk; namely, 
wetting his thumb with his tongue as he picks up the waxed paper. Would you be 
very happy with that particular style of butter-wrapping, even though it produces 
more wrapped butter than any other? At least in some cases, then, we must take 
into account factors of style and process in orderlo evaluate the very desirability ot 
the outcome. 



Continuous Evaluation: A Basis for Adaptive Action 

Ideally what is needed is a systematic basis for understanding not only the charige 
processes operating in programs per se. but the moderating factors and interactive 
forces operating in the broader context of the educational system. We need 
methodologies for objectively determining and describing basic characteristics ot 
system functioning and for evaluating both the changes in those characteristics and 
the efficacy of the change process in relation to intended and anintended outcomes 
of the system and of intervention programs. 

" • 19 



Hvdtudting Educational Programs 



One possibility would be to iiUroduce into school systems a continuous program 
of information collection and analysis to unJbver existing regularities and processes 
and to monitor their perturbations over time» Such an information and assessment 
program would have to be comprehensive enough to assess a wide range of student 
outcomes representing intended and unintended consequences of the educational 
system, as well as characteristics of teachers, students, administrators, programs, 
and settings that might interac? with each other to produce differential results. It 
wo^Ud also be particularly important to include some provision for observing and 
documenting those most critical of all regularities occurring between teacher and 
student in the classroom. Given the cultural basis of much system functioning, it 
would be vital to incorporate procedures for assessing the attitude^ and values* roles 
and relationships, perceptions and expectations, and aspirations and goals of the 
various parties to the system and to attempt to clarify dimensions of consensus ind 
contention permeating the social matrix. 

With such a multiplicity of coinponents, it is obvious that the program would 
have to include an analytical methodology capable of multivariate and longi- 
tudinal comparisons to provide mechanisms for disentangling the threads of 
multiple and interacting influences. Ultimately such an assessment effort should 
attempt to go beyond the description of system functioning as it exists to the 
development of causal models relating prior conditions and processes to out- 
comes, so that the potential consequences of alternative educational programs 
could be systematically anticipated. 

A iiew instructtonat program or intervention project could be introduced into an 
educational system being monitored in this fashion with prior warning about 
existing- social pressures and with periodic feedback enabling adjustments and 
improvements in program procedures. The impact of the intervention could then be 
systematically evaluated, in straightforjvard if not routine manner, in terms of 
changes in measures of system characteristics, especially student outcomes. 

In the accountability system proposed for the New York City schools (I), for 
example; 

The first step is to gather data on as many school and community characteristics 
as posMbie. Two criteria are used for selecting these variables: (a) they must be 
factors the school system can do something about; (b) there must be reason to 
expect that changing them wiU affect student learning. (These arel^lled) prwess 
lariahles because they describe processes that affect instruction and learning. The 
process variables suggested for initial study fall into five categories: charaeteristics 
of the school's staff, characteristics of the school's st(«[lent$, characteristics of the 
Hrhoofs programs, characteristics of the facilities, and characteristics of the 
community. To illustrate: a staff characteristic might be ^cational experience; a 
student characteristic, reading habits; a facilities characteristic, age of sc^ool 
building; a program characteristic, total expenditure on programs; a community-^ 
district characteristic, percent of teacheis with more tha<f thjee years* experience. 



Samutl Mtttick 



A comprehensive educational system information program of the type envisioned 
httt repr^^ents an ideal that would require considerable effort to realize. It offers 
the advantages of a wide range of information bearing on administrative and 
instructional decision making, program development and evaluation, and account- 
ability, and in the long run could serve as a vehicle for continuing research on the 
functioning of educational systems. As a continuing program, it would also provide 
an early warning system signaling the appearance of new types of students and new 
or changing conditions, thereby permitting timely modifications" in programs to 
accommodate to new inputs and circumstances. Its mi^or disadvantage are the 
time and resources required to develop, implement, and maintain it* Other potential 
difHculties derive from the fact that such a coniprehensive assessment program 
would itself become an important part of the system it is meant to monitor and 
hence might mfkience or modify that system in unforeseen ways. Thus, even, 
techniques introduced to monitor system functioning and to facilitate informed 
decision makirkg may produce more pervasive effects and^Aould themselves be 
eixamined in terms of system interdependencies, in particular with respect to the . 
psychology, economics* politics, and ethics of information collection and use. / 

This concept of continuous evaluative research is intended to provide* at the 
system and program level, a rational basis for adaptive action* In addition, however* 
much of the information involved in this ongoing evaluative function, such as 
measures of pupil progress, could serve both evaluative and instructional purposes. 
Thus, if systettiaticalty implemented with feedback at the teacher and dassroom 
level, this conception could help make evaluation an integral part of the educational 
process itself. 



References 

U McDonald. F.» 4 Forehand* 0. DoAign for Accounttbitity in Education/* New York 
University Education Quarterly^ VoU 4, 1973* P0» 7>I6. 

2. SarsiOit* The Cuiture of the Sehooi end the Problem of Change. Boston: AUyn 
Bacom 1971* 

3. Mestick; S. "The Criterion Problem in the Evaluation of Instruction: Assessing Potsihle« Not 
lost Intended Outcomes/* tn M. C. Wittrock it D. E. WHey (edt«)» The Evaluation of 
Instruction: issues and Probieme. New York: Holt* Rinehtrt« & Winiton» 1970. Pp. lS3-20a. 

4. Shtilman* L. S» "Reconstruction of Educational Research."* itev/m of Educational Heseareh, 
Virf. I$» i970. Pp. 371*39$. " 

5. Messickt S. ''Evaluation of Educational Frofranui as Research on Educational Process." In 
F. F. KorteA» S. W. Cook» 4b J. I. Ucey (eds*). P^ycHotogy and the Problems of Society. 
Washintton: American Pisycholofical Association* 1970. Pp. 215-220. 

«: WHtrock, M. C, & Wttey» D. E. (eds*)* The SvaiuatU^n of instruction: issues and PfObUms. 
New York: Holt. lUnehart* is. Wnsto^* 1970. 

\ -, • . 



ERIC 



ERIC 



Statesnen, Scientists and Soothsayers: 
Forecasting Assessment NeedslfTHigher Education 

By K. Patricia Cross 



The extiilaralion and frustration of the assessment business is due in part to the fact 
that evaluators exiat on the cutting edge of change. Agencies such as ETS are 
shaped by, %nd in turn help to shape, the directions of educational innovation. 
Because it takes a long time to develop methods and tools for assessment, we must 
often predict educational directions before they are fuUy articulated by the educa- 
tional community. On the other hand, we can't be so visionary that we develop 
materials educators are no t ready to use. The tension between the push and pull of 
^reiponsiveness and leadersfiip is constant, and at the practical leveU timing is 
critical. In the past, ETS has. made errors of being both td^latc and too early. But 
we have also had our share of the sort of good timing that serves the educational 
community well. Let me ^ye one example of a test development program that was 
started far enough ahead of the demand so that it was well developed by the time 
educators were rtady to use it. 

Ten years ago, in 1963. ETS started the development of what has become the 
Cotlege-Levei Examination Program (CLEP), designed to help adults achieve credit 
for learning outside the classroom. Not so incidentally, that was also the year when 
Harold Wilson made a bid for the inlellectual vote in England by proposing the 
Open University. Open University, as you may know, is designed to provide 
educational opportunity for thousands of Britishers who never had the opportunity 
for education beyond the secondary school. It concentrates on delivering education 
to people who, because of life circumstances, cannot study full-time on a college 
campus. The mission of Open University is educational opportunity, and the 
attractiveness of that concept was fully apparent but not universally articulated in . 
1963. 

The existence of such a successful niodel as Open University, however, helped to 
stimulate worldwide mterest in external study and lifelong learning. Assessment of 
out-of-class learning is a crucial accompaniment of the external degree, and the 
timing of CLEP helped immeasurably to spread the acceptance of credit-by- 
examination in this country. Development of the program, remember, had been 
quietly under way for five or six years before nontraditional study and the external 
degree became a red*hot innovation. Because of good timing, the CLEP exams were 
'read>^ to meet rising demands for credit-by-examination and to stimulate even 
greater interest in proficiency testing by their very availability. Today, proficiency 
on the CLEP exams is accepted by two-thirds of the colleges in the country as an 
alternate to classroom experience. 

Who knows what the ingredients for successful timing really are? Actually, the 
CLEP tests were probably little ahead of their time. They were devised because 
they seemed a good and educationally sound idea, not because there wa^ a readily 
visible market. But we could also cite some examples of good educationally sound 
ideas that never caught on. ETS was ready to provide tools of assessment that 
would help education^ but there was no adequate inarket for them. 

Testing agencies frequently serve a fluctuating role between stimulating innova- 
_tl^ft giij^jg^pfipHitt g tn it Th ii ^ a vaaabtfttv-o f ai ^s es&ment tgolTflialces possible some 
reforms, but the best tools in the world won^t spark a reform that educators are not^ 

" 23 . 



K.Patncia Cross 



O 

ready to implement. Those who worry uhout testing agencies dictating educational 
directions have never tried to keep a program afloat in a nonprofit corporation until 
educators wanted to use it. 

1 think I can make the case that timing for KTS requires the presence of scientists, 
statesmen, and sooths^yeis u\ ..le organization. Professionalr at ETS must play the 
role of educational statesmen and reformers by recognizing educationally sound 
ideas and helpingUo influence the development of the tools that would make 
possible their implementation. Out role as scientists^ and educational researchers is 
to understand the impHcations of data that extend the frontiers of ^.nowledge 
a6out the educational process. And our role as soothsayers is to recognize, even 
before they articulate it, what colleges and students are going to want. My personal 
focus at ETS has been on serving students. As researcher, I need to know what they 
want. As statesman, I n^d to know what they need. And as soothsayer, I need to 
know what educators are going to give them* 

/ 

A Determining What Students Want 

Let usytalk first about the role of scientists or researi^hers at ETS in determining 
what students want (in educaffon. Dick Peterson, a research colleague in the 
Berkeley Office of BTS, conducted some rese^trch last year at the request of the 
California State Legislature (1). He administered the Institutional Goals Inventory 
(IGI) to faculty, students, administrators, trustees, and citizens of the local com 
mttnities of 1 16» private lind public colleges in California. Each constituent group 
was to r^te their college as it and as it shotild he on 20 different Institutional 
goals. The results indi«Uted that students in the various* types of colleges were 
notably dissatisfied with'two aspects of their present education. The extent of their 
dissatisfaction was shown by the discrepancy between the is and shouU be ratings 
of their college. * \ 

Highly able students at eight campuses of the academically selective University of 
California felt that too much attention was given to the acquisition of knowledge in 
the academic disciplines. The^ said that, as a goal, aci^detnic development of 
8tii\lents>r^sentty ranks third fn the Itst^f university priorities whereas they would 
demote it to 14th place in the list of 20 institutional goals. Their fellow students In 
the community colleges agreed that even in *heir less academically rigorous- institu- 
tions, the emphasis on acddemic content was greatly overdone^ They would demote 
it from its present first priority lo ninth. When students across the broad spectrum 
of academic accomplishment reject so emphatically the major activity of collegfes 
and universities, we have some thinking to do. Are students failing to perceive the 
importance of our efforts, or have we lost our perspective about the mission of 
' higher educati^n?_Fuiin^ 
questtOTL J^- 

It is quite* clear that students are not expressing anti-intellectual attitudes* ThejL. 
are stibngly supportive of activities that place more emphasis on the development 

. . 23 

■ ^ 24 



Forecastmg Assessment Needs 



of the appreciations and skills of learning. They recogtTiate as important such goals 
as these: instilling a commitment to lifelong learning. >Teating the desire for 
selfsitrecled learning, and developing ^n students the skills of analysis and 
synthesis. The message that seems to be conveyed by this research is that we 
need to give more attention io the development of intellectual skills and atti* 
tudes and perhaps less to the simple purveying of information. Thus, as scientists 
we know something about the desires of student- and their frustration with their 
present education. 

As statesmen we can only agree that education for content mastery Is a ver* 
short-range objective. With the speed of change today, content knowledge is 
outdated by the time a student moves from freshman to senior-even assuming that 
professors are up*toHlate on the newest developments in their field. Instilling a love 
of learnini: md a Bexibie response to chang e , tuthei than " c on t ent mast ery, should 
" "be the fundamental task of a college education. Difficult as the project is, the 
educational leadership must begin to tackle that Job« and we n^ed the tools of 
assessment in order to do it. 

Finally, as soothsayers, we need to make some guesses about the educational 
programs th3t colleges are^going to provide. One w^ of injecting some science into 
our prediction of what educational institutions are going to do is to look at 4he 
priorities of the power structure- the faculty and administration-of colleges. 
Referring again to Peterson*s data, it is clear that almost everyone,- faculty, 
students, administrators, and lay community people -would give the development 
of broad intellectual skills and interests higher priority than they now have. Thus, it 
is a very good guess that if we, as evaluators, could develop convincing measures of 
these attributes, there would be a ready market for them, and we would be 
contributing to improved educational practice. 

The future of tests directed toward assessing content mastery in the academic 
disciplines is not so cl^r. While students are obviously dissatisHed with the present 
overemphasis on content mastery^ faculty, administrators, and governing board 
members of the universities of the California system fail to see any huge 
discrepancy between what is and what should be in this area. Thus, a valid 
short-range prediction would seem to be that traditional a'bademic achievement 
tests will enjoy continued use in the universities because the people who determine 
educational programs there value content mastery and emphasis on knowledge in 
the academic disciplines. 

The situation in the community colleges, however, is quite different. Community 
college faculty and administrators are almost as Jisilmsioned about the goal of 
. academic development as students are. Their top priority g6als are the creation of a 
feeling of community on campus, the vocational preparation ot students, and the 
personal development of students* The academic development of students is given 
only a I2th-place ranking by both faculty and administrators in cominunity 
colleges today. As soothsayer then, it would seem to be a reasonable prediction, that 
community colleges will lead in'^de^emphasizihg assessment of academic content 
mastery in favor of acceptance of a greater range of tools of assessment. 



24 



25 



K.. Patricia Cross 



Attending to Personal Development ' ^ ' 

Let 08 tuni now to another source of student dissatbfaction with present 
educational programs. Students in all kinds of colleges are expressing a marked 
dissatisfaction with the present state of af ifairs regarding attention to the pereonal 
development of students. Personal development Is defined by the IG! as helping 
students to do the following things: identify and pursue tht^accomplishment of 
their personal goals, develop a seAsc of self«worth and self-conifldence. achieve 
deeper levels of self-understanding, and develop open and trusthig relationships 
with others. University stydents claim that the university gives this goal 18th' 
priority whereas it should iank fourth. Community college students think atten- 
tion to student devdopment is even more important; they would rank il second 
in the list of priorities. Community, college students get a great deal of support 
from thej^ faculty land.„administrators for ^eater attention to noncognitive 
growth as a respoTnsibiHty of the college, but university students get very little. 
University of California faculty would grudgingly move emphasis on personal 
development from its present 14th place only to 1 0th place, and administrators 
are only slightly more interested. The power structure of the university thus 
stands in' marked contrast to the students-and to lay members of the community, 
who agree with students about the importance of personal development as a 
function of a university education. 

As soothsayer. I predict that evaluators could be quite helpful to conmiunity 
colleges by developing some good measures of noncognitive growth. While I see no 
immediate market in the universities, I mi^t go so far as to say that noncognitive 
measures are a good bet for a long-range forecast even in the universities because 
the present dissatisfaction of students, supported -by the general public, is likely to 
have an impact on the bastions of -university traditionalism. If the measures prove 
successful in the community colleges, there is svery probabiUty that they will be 
desired by four-year institutions. 

So far I have talked mostly about the changing needs of the present educa- 
tional community. My analysts has been applied tp students currently in the 
system, ^taw I would like to take a look at students presently outside the 
systemf^Wt waiting to get in. For if higher education has any growth potential at 
all, it is in those segments of the population i>r^usly unserved by higher 
education. The present leveling off of college enrollments is not a temporary 
phenomenon. The actual number of five-year-olds dropped 15 percent between' 
1960 and 1970. These are the potential college students of 1982. The number of 
births dropped 12 percent between 1970 im 1972. These are the potential college 
freshmen of 1990, and the nation's birth rate U now sUghtly below zero population 
growth (2). In other words, the college students of the next 17 years have already 
been born -or. of more significance to my argument, not bom-and there Is no 
expansion possible except to pick up some of the 40 percent of high school 
graduates who do nut presently enter college or to serve the growing adult market 
of higher education. . ^ 

25 



Forecasting AtsAisment Needs 



Looking &nhe New Students 

An analysis of students new to higher education shows these%rpup$ on tht rise: 
I) Students from the second and third academic quartiles of the high school 
graduating classes. These students^ai$e, of course, flooding into t)pen admissions 
colleges. As a matter of fact, in the p^st two years, 8S percent of ^ the increase in 
the number, of nrst*time{^tudeiUs entered community ooUegei^ 2) The proportioh' 
of ethnic minorities and women is on the rise* whereas the propoatlion of white 
males entering college impiediately following hi;^ school graduation is on tlje* 
d^line. It dropped from 63 percent in 1968 to 53. percent in 1972 (3>. .3)The 
proportion' of adults returniijg tfiCCjSilege is on the n?ie, and if the ^ontraditional 
studies movement i.o«itinues itis^ precipitous pade« nhis will be the most rapidly 
increasing segment in all higher education. So let us look at What these new jpiH^ers 
want -because in a sense they are in\^ good position to get what'they w^t. It is 
now a buyer's market « and coUegeSr are iikely to provide the progfan^ that will 
attract the new learners. . " ; ' • ^ . • 

First, let*s look at the group that I, in my own research and writing, havQ called 
new siudef)(s \o higher education. These are primarily low academic achievers 
admitted Xo higher education by open admissions. Despite all ih^ publicity ghnen to 
ethnic minorities as new students^ it is really the low*achieving white sons arid 
daughters of 1>lu^-collar workers that are the chief beneficiaries of the movement of 
full equality of educational opportunity as reflected in 4he community colleges. As 
a matter of little known fact« about two-thirds of ^11 the stutients in remedial 
education in community colleges are white, not black* these studeitts^re not p 
college for the love of learning or for the liberating influence of higher education; 
they are entering community college because they see it as a pathway to befter 
jobs. They are not eager for liberal arts education except^ as it will give them 
respectable credentials, and they are b'lvily career-oriented. 

The career orientation of these groups of low academic achievers is matched by 
ethnic minorities and Vomen. They, too, are primarily interested in access to higher ^ 
education as an entree to the mainstream of American economic society. 

Finally, it is quite apparent from research collected by ETS on the interests of 
out-of-school adults between the ages of 18 and 60 that adults want an education 
that is useful. They will be attracted to college if the education provided helps them 
to develop tlie competencies that will be useful in their daily lives- to get a beuer 
Job, to invest money wisely, to cook and garden and raise children, to repair things 
around the house, and to enjoy their leisure time. 

The groups of st9jle()^ that represent, the growth potential for postsecondary 
educ^on« then« are interested in learning how to do things-as opposed, perhaps, 
to how to think about things. This suggests, to the scientists among us,.a'need for 
greater attention to what has been called competency*based education. If we can 
develop the tools that will tell students and e.mployers what kinds of tasks people 
can perform, we will be meeting a need that is already, emerging as a particular 
result of the increasing proportion of new students in colleges, 

26 - . 

27 



» As soothsayers, we may be a I-'tle late in marshaUng our resources to meet this, 
jieed, which has already been articulated by the educational community. But as 
statesmen, ive recognize the educational soundnecs of education and evaluation to 
help people improve their performance toward a specified goal-j.s opposed to 
educaUon and evaluation that is basicaUy competitive and tells people where they, 
stand with respect to others. Normative testing is an essential tool for selective 
purposes, and It served selective admissions in the meritocratic years of the 19508 
and '608 very weU. But supply in higher education has now caught up with demand, 
and in our new egaUtaran decades, edWators are shifting attention from the task of 
selection to the jofr of education. Thus, assessment k moving out of the admissions 
office and into the classroom. In. serving the new students to higher educaUqa we 
have an example of the dual role of eval>iation-to te responsive aftdjo^emon- 
strate leadership.' IdeaUy. we wUI be responsive to the new intere8ts>the educative 
process, and we wiH offer the tools and services that wiUhelp to lead education to 
new levels of accotnplishment. ^ " 



References 



1 Peterson. R. E. Cotfte for CaUfamta Higher Education: A Survey of 1 16 Academic Com- 
miS Prepir^dfor the jiint Committee on the M»?tef. Man for Higher Education. 
California State Leflslatttre. Berkeley: Edncational Testing Service. I97J 

2. Olcnny, L. A. 'The '60« In Reverse." The Research Reporter, Vol. 8, No. 3. 1973. V^- »-4. 

3! Young. A. 'The High School Class of 1972: More at Work. Fewer in School." Monrft/y 
lator Review, iuae 197$. , . " 



•27 

^ 28 



Assessment for Career Decisions and Rotes 

By Winton H. Manning . 




A turn of a phrase thai is strangely jarring to the reader sometimes captures the 
mpifJity of change in our way of life that paragraphs or pages cannot do half so 
well. So it is that at least once or twice a month an obituary in the New TorJcTlmes 
for some fallen business or prof ssional leader will say* snd I quote an example: 
"Mr. Smith prepared at St. Faults for Yale. Class of 1924, and completed his 
education at Yale Law School in 1927, whereupon he entered the practice of law in 
New York City/* 

Few concepts are embedded as deeply in our culture as the notion that work is 
the object ot life. Consequently education has been regarded as preparation for 
work, or even in the not too distant past for a career befitting one*s "station in 
life.** Hence, as Robert Hutchins said, "(Education) has been thought of as a 
chi!dren*s disease; having had it once, you need not, in fact you cannot, have it 
again. This attitude has bee« reinforced by the organization of educational systems 
into stages: as each stage is reached, the one that is left behind is •finished.* Xnd, if 
an education is regarded as instrumentat-to a job* marriage, a degree-its purpose 
has been fulfftled when its object is attained ( I ).** 

Hutchins goes on to quote Margaret Mead on this same point: 

e 

"...our educational institutions are places where we keep childretv for a shorter or 
longer perlod...Once they have left, we regard them as In some sense finished, 
neither capable nor in need of further education, for we still believe that 
education should come all in one plece^or rather in a series of connected pieces, 
each presented as a whole. ...Thus we avoid facing the thost vivid trtith of the new » 
age: no one will liw alt his tif* in the worid into which hew(fs ho^n, and no one 
will die in the world in which he worked in his mattirity { I 

hat does tfiis mean for educational institutions and programs? For Hutchins and 
btfters, it means raaictil reformulation of our ^deas about when, where^ and with 
whom education takes pzace. In short, if we are io live satisfying lives in the decades 
ahe^, we must somehow create a 'learning sqjriety.*' 

' Educational rneasurement, as a servant of the educafional process, is necessarily a 
victim of the same dogmas of thev past tjiat .dominatV popular conceptions of our 
educational system* If education is seen as a **series of connected stages, through 
which one successively passes/* it is no accident that examinations have served 
prijnarily as bridges froAi one stage to -another, probably cuttninating fo^ the 
recently deceased Mr. Smith in his bar examination, after which ^he could consider 
himself finished' with both education and tests. How different such a^life-style is 
from that of many persons now in their maturity^ to say nothing of young people. 

Just two weeks ago, for example, my wife received notiflcation from the State of 
New Jersey that she is at long last licensed as a clinical psychologist. Stretching 
back from that official letter, which gives her legal sanction to enter into private 
practice as a professional psychologist, is a 20-year trail of examination papers- 
intrluding college admissions tests, graduate admissions tests, Ph.D. qual fying 
examinations, doctoral orals, licensing examinations and so forth-but, more 



2«. 



ERLC 



29 



Winton H. Manning 



importantly, all this was interspersed with babies, three cross-country moves» and 
all the vicissitudes of working one's way through an educational system that has 
never reaUy seriously evaluated its procedures from tiff standppinVpf a professional 
woman in our culture. Her first response to the letter was not, **At last! rU;never 
have to take another examination as long as I live!** Rather, her reaction was, "Now 
1 can get busy learning more about some of the^things1*m interested in!** A basic 
assumption of this paper is- that we are going to see fundamental changes in 
postsecondary education over the next 25 years, which will make access to 
postsecottdary education open, continuous, and more interrelated with employ- 
ment and leisure than ever before - in short, a much different world than you and^I 
have grown up in. 

What are some of the ways in which educational measurement can serve to 
support this concept of Ufelong learning- of a ^learning society/* to use Hutchins* 
term? 



Appraisal of Relevant Learning 

Clearly, there is an enormous job to be done in develqping sound procedures 
concerned with the assessment of learning that goes on in experiential settings- that 
is, with comp "teneies that are acquired independently by learners of any age, 
through work or other life experience. The boundaries separating lormal educa- 
tional institutions <;nd the wider world of learning through experience are now 
breaking down» and the need is growing rapidly for new anproaches to the appraisal 
of ''relevant, demonstrable learning that has been acquired in nontraditional study. 
— FcM'-exdmplc, wt are JAi>W attempting to oi^hize a conson?um of about 40 agencies 
and institutions who want to work cooperatively together, and with ETS» in 
developing, measurement services appropriate to these new needs. Even five years 
ago sxfch an effort would have been imthinkabte. 

If I am correct in asserting that the educational,^ system h resohitely moving 
outward in the direction of what Hutchins first called a **university without walls," 
it is also changing in another dimension to which I referred earlier, namely, 
education is becoming a lifelong experience. Again, Margaret Mead stated the case 
well when she said: 

Often, only a few months may elapse before something which previously was 
easily taken for granted roust be unlearned or transformed to fit the new state of 
knowledge or practice. In this world, no one can complete ar^ education (2). 

The world of Mr. Smith, like himself, has been decently interred. In its place, we 
must design new educational arrangements for a new world of learning. That is true 
whether* we speak of liberal learning necessary to function as a competent citizen, 
or of specialized knowledge necessary for a productive, satisfying career as an auto 
mechanic, a clinical psychologist, a dentist* or even a journalist. 

29 



Assessment for Career Decisions 



In Ihis respect, HTS has been actively working with a number of groups who 
recognize ih^t the rapidity of change requires continuous access to education. One 
of these is the National Institute of Automotive Service Kxcellence, which sponsors 
the auto mechanics certification program. Others are the American College of 
Dentistry atul the American Board of Professional Psychology, to name a few 
examples. In each instance we have worked with specialists in the field to develop 
examinations that are voluntarily being taken by practitioners as a meaas of 
self^appraisal. The examinations become » if you wilU a framework or grid of 
knowledge and competencies against which the auto mechanic or the dentist can 
measure himself. The *esuUs of these examinations axe useful in helping the 
individual direct his own continuing education or training, and they also can assist 
those concerned with devisi)jg programs of training or continuing education that 
meet the current needs of individuals more effectively. 

In some instances, notably the auto mechanics program, the results of the • 
examination also can be used as a vehicle through which the individual can » 
communicat-e the level of competency he has acquired to the public at large. When 
M;. Smith graduated from Yale, his name was inscribed on the rolls of the Class of 
1^24 and his degree was conferred with the tull panoply of medieval splendor. The 
chevrons and certificates awarded to the auto mechanic by the National Institute 
constitute comparable recognition, as well they should in a society* that dignifies 
the worth of the individual and esteems competence however acquired. Further- 
more, it is worth noting that Mr. Smith was never faced with the necessity of 
demonstrating in 1^34^, to say nothing of 1964, that he was keeping up with the 
times in order to retain his degrees? Hxamination prograjns are, therefore, beginning 
to serve as the handmaidens of continuing education within a context of lifelong 
learning. 



There is, however, a more fundamental aspect of education for which the rapidity 
M change requires that we rearrange our views and restructure the assessment 
afrvices that will be supportive of educational processes in a learning society. This is 
education for personal development. If not survival. In modern society. 

One of the consequences of the American conception of education as preparation 
f 3r work is our inability to distinguish between education and training. Thus the 
objective of the educational system is to yield trained manpower rather than io 
produce educated people. This is reflected in the development of vocational 
giidance services in this country « which has reUed heavily on the techniques of 
abtitude testing and related assessments. 

lOn the whole, vocational guidance testing proceeded on the assumption that 
tljere exists a large but finite number of jobs that can each be described in terms of 
the abilities, knowledge, and interests necessary to perform them.. The task of 
measurement specialises is one, therefore, of sorting or cl^issifying people by these 



Education for Survival 




' 3t 



.A • 



WtntonH. Manning 



■ assessment techniques into the appropriate job-related hierarchies, and then 
"guiding*^* them in the appropriate direction through counseling. The technical 
problems can be formidable, but m isonte applications, such as the military servicesv- 
very sophisticated systems have been developed along these lines. When I was a 
graduate student, this kind of nwxtel was4mpHett in-evefy^iftdustfial-iwyehelogy 
course I took. 

llftforlunalely, this model of education and ofLoccupationaL assessment does ^^ 

not deal well with the dynamic character of jobs today. The facts suggest that in 
a large number of occupations the pattern of abilities, skills, and knowledge can 
shift continuously as a function of rapid, technologically induced changes in job 
demands. Furthermore, as James OToole points out in Work in America, the 
jrcport of Elliot Richardson^ Special Task Force > about thr ee-quarters of the jobs 

^ afTTHET country toaayHrequire relatively little advanced training, and what is 

needed in this regard can be accomplished in two or three days of on-the-job 
training (3)« 

Consider what may be the most symbolic but humble task of mankind-the 
- baking of a loaf of bread. Hutchins reports the existence of bakeries in Germany 

where Spanish women ride to and fro on bicycles, watching sigria! lights on fully 

automated bake ovens that carry on the baking process from the raw material to 
packaged loaf, perceived only by electronic .sensors (1). Or take the case of the 
Lordstown plant, GM*s automated showcase, which inspired the prophetic title of 

Emma Rothsehi!d*s book, Paradise Lost: Decline of the Auto lftdustriatAgeX4): 

What are the abiUty demands of such Jobs? More importantly, what are the needs 
L for education- iiv order to Hve in such a worW? 

♦•Work,** my industrial psychology professor told me 20 years ago, "is a form of ^ 
activity that has social approval and satisfies a real need of the individual to be 
active. To produce, to create, to ^in respect, to acquire prestige, and incidentally 
to earn money -these arc some of the reasons why people work/* The lack of some 
of these things is, we are told, the reason why young workers struck the Lordstown 
plant in February 1P72. It also demonstrates the folly of considering education as 
simply preparation for a job, and the necessity for considering education as 
something more than training. 

To return to the point made earlier, tne failure of the American educational 
system to distinguish bet*veen education and training produces too many people 
who are not only ill-prepared to cope with the worid of work as we now see it 
taking shape, but ill-equipped to enjoy the benefits of increased leisure that 
technology is beginning to provirfe. How- is this connected to the problem of 
developing assessment services that are responsive to the new work environment? 
The older model of sorting people into tHe **right" slots is defeated before it begins 
because of the tempo of change. At best it will increasingly come to apply only to 
upper portions of the job hierarchy that require complex professional and technical 
• knowledge. In other words, we have on our hands a vocational guidance model that 
has been rendered obsolete by the march of events, and which can justify its 
survival only by f lecoming increasingly elitist in the clientele it seeks to serve. 





31 



ERIC 



■V 



r 



Assessment for Career D^cittons 



Guidance in Career Decisions 

At ETS, Martin Katz has been leading ai^ffort to develop a new approach to 
vocational and occupational guidance one w**ich emphasizes the processes of 
decision makings and v hich has as its objectives increasing the competence of an 
individual to understand himself « improving his ability to acquire ati[d use 
information intelligently, and enabling him to make informed decisiohs for 
himself. Known as SIGI -System of Interactive Guidance and Information- Kat2*s 
approach uses sophisticated computer technology within a fra...ework of humane 
educational objectives. SIGI is concerned with education and not training, in the 
sense that education is, in the last analysis, concerned with encouraging ih^ habit 
of ihinking, the capacity to distinguish between the important an<i the 
unimportant, and the development of critical standards of thought and action. In 
Martin Katz^s words: 

Our biati our conviction -Ik that in education enlightened processes are Intrin- 
sically-impar^ntr-Therelofe- we- bend* om^ efTaTr^70''tftefeasc the sfudent*^ 



understanding of the factor!^ involved in choice (imperfect though our under- 
standing may be) so that he can take responsibility for his own decision making, 1 
examine himself and cxptort: his options in a systematic and comprehensive way, t 
take purposeful action in testing hypothesifs about himself in various situations, | 
and exercise flexibility in devising alternative plans (S). 

New approaches to assessment for career decisions and roles are if\ the making, bnd 
SIGI is a preeminent e ;ample. Career assessment must go hand in hand with 
changes in educational systems and in the world of work, all three of which larc 
inextricably intertwined. Whether there will emerge out of this ferment a new ind 
exciting period of profound change in the philosophy and methods of educatibn, 
a^essment, and work is hard to say at this time. But ETS is engaged in the effort to 
make assessment serve the individual^ thus contributing to this emergence. i 

Perhaps, in an economy where work as we have known it will become increasingly 
less important, it will be possible to implement the sort 61 system of education 
envisaged by Dewey jn 1916, when- he wrote: 




Both practically and philosophically, the key to the present educational situation 
lies in a gradual reconstruction of school materials and methods so as lo utilize 
various forms of occupations- typifying social caUings^ and to bring out ^tfaeir 
inteiicctuaf and moral content (6). 



Hutchins goes further to the point when he adds: 



(Man's) ed>i;#tional institutions h^w been designed largely to perpetuate 
existing values. Ultimate recognition of the facts of Ufe may force reconsidera- 
tion of those values and the redirection of education toward new ones. The 



32 



33 



WintonH, Manning 



first stq> is general understanding of the facts of life* of ttte new values that are 
now attainable, and of the possfoilities and limitaUons of education in helping to , 
achieve them (1). 

We have a long way to go to reach the objectives set forth by Dewey and Hutdiins^ 
Tfae (ask^ ibf the^ssessmcnt comtntmity is tatake firm hold of thwe new directions 
or, like Mr. Smith, look forward to a long* long sleep beneath the sod- 



References 

1. Htttchins, H . M. THe Learning Society. New York; Praeger, 1968. 

2. Mead, M. •Why U Education Obfu>U -nt?" In R- Gioss (ed-). The Teacher and the Tm^t. 
New York: Deii, 1 963. P» 27 1 . 

3 Work in America: Report of a SpecM Task Force to the Secretary of Health, Education, and 
Welfare. Cambtidge, Mm.: MIT Prcsi, 1973. Pp. 1 34 1 52. 

4. Rothschild, E. Pamdise lost:" Dectine of the Auto industriai Age. New York: Random 
House, 1973. 

5. Kati. M- "Exploring Values for Career Decision Making: ^ C»mP«j^-8f«^l^y?;®"* 
Intmctive cSdance and Information." In Measurement for Self.Vnderstmdingand 
Pmonal development. Proceedings of the i97S invitational Conference on Testing 
Problena. Princ ton. N. J-r Educational Testing Service. 1974. 

6. Hewey » I. Democracy and Education. New York: MacmiUan* 1916, 



34 



^ implications of Assessment for Society 

By Melvin M. Tumin 

The delightful vagueness of the global mandate Thave been given to assess ''the 
ttnpHcations of assessment for society" both requires and permits me to interpret 
my task according to my own best fights. I must do so, however, with decent regard 

for what I judge to have been the intentibns of the organizers of this conference^ 

.,Th«y Kaaw as .wfiU4»44hal 4-««vimifter psyc^^ sociometnciafTTtake 
it, then, that I am asked to consider the activity called assessment, in its many 
manifestations and contexts, and suggest some obvious, and perhaps some not-so- 
obvious, significances of that activity for this culture and society. My questions * 
thus become: How extensive is» assessment as a cultural activity? What functions 
does it serve? What does it portend? 

Taken in its broadest implications, the term -and the ^ctmiy -assessment goes far 
beyond the boundaries of educational effort. If we focus for a moment on the 
diverse functions assessment serves in the field of education, and use those func- 
tions as a checklist for looking at other fields of cultural endeavor, it becomes 
apparent that, every m^r institutidnal nexus of our society is subject to the same 
range of assessment efforts. Theatre, poetry, fiction, sculpture^ painting, architec* 
ture, dance, and music -in short, all the arts. Fiscal policy, monetary policy, 
investment activities, fluctuations in the stock market, growth and fluctuations in 
the gross national product, bankruptcies and successes- in short, all the economic 
activjties. Child rearing. faJnily solidarity and rupture, sex roles, day care centers, 
courtship and marriage, abortions and contraceptlon-in short, all the institutions 
connected with kinship and reproduction. Similar reciXations can easily be made for 
politics, recreation, religion* health and welfare, science, stratification systems, race 
and ethnic relationships, and international affairs. The entire institutional 
framework of our society is deeply involved in assessment activities of all kmds. # 

What are these activities? They involve measuring what has transpired, predicting 
what might transpire, understanding why events have taken place as they have, 
determining the weights of different variables that arc com^dered relevant, locating 
the significance of events foe the society, considering future lines of policy and 
action- in short, every conceivable dimension of these institutionalj* activities is a 
proper subject for some form of assessment. Add to these the enorgious effluence 
of ''how td'* literature, including how. to perform the traditionally most private 
activities, not excluding the traditionally most tabooed activities* **How to" is, after 
ail, a fundamental kind of assessment. If involves an understanding of means-ends 
relationships, the resot^rces heeded to perform a task, the training required to 
• develop skills, the attitudes requisite to full participation-the whole kit and 
kaboodle. Even Linda Lovelace*s talents are available for '^teaming** in various 
*'how to** manuals. • 

We can now see ttiis activity of assessment, in ail its ubiquity, on quite different 
levels of signiflcance, though those levels are not necessarily hierarchically ordered 
relative to each other. Assessmetft represents, fii'st, an attempt to order rationally 
and give structural coherence to human activities that otherwise would seem 
inchoate, and without form or piittern. Asse^ment is also obviously intended to 
provide humans with somewhat greater control over human affairs, and affairs of 



34 



Melvin M. Tumin 



non-hum?n nature, such as flooas. tfarthquakes. and the weather in general. Third, 
assessment functions to make it possible for humans to exercise some greater 
rationality in choice amon g numero u s competing alternatives (e. g.. culiepe courses — 
and movie oitenngs>. F ourth. axti,esgnent, such as is involved in market research, 
serves to maKe it possible to increase the range of choices human* might have 
available among future goods and seryices. Assessment also provides an extra- 
ordinary playground for both common and elite participation in mjyor cultiiral 
activities. The millions of Monday-morning quarterbacks who have expertly 
diagnosed the wins and losses in Saturday horse races and Sunday football games 
are an example in point. Every man his own expert -a crucial function in a society 
in which genuine expertise requires great and prolonged effort. 

A 'Question of Control 

If tllere is one common strain that runs through all these functions of assessment it 
is the theme of the management of diversity. But that is not enough. For it is 
crucial also. I think, to recognize that the very effort of assessment, the presump- 
tion of assessment, if you will, is the predication of human responsibility for human 
affairs, of human activity as the instrument of human destiny, of human capacity 
to direct human affairs, of final human control over human fate. ■ 

I deliberately repeat and stress the word human to contrast it with an earlier 
theme that was dominant just about 200 years ago. when no such presumptions and 
predications could have received much public reception. For until 200 years ago 
human fate .was considered to be subject primarily to Divine guidance and rule, 
including, importantly, the. variance in human conduct made possible by the Divine 
grant of iree will. Under that system of beliefs, the actions we call assessment here, 
which involve humans peering at themselves and their behaviors, trying to know 
about, understand, and see significance in, and to speculate about, futures all these 
manifestations of human self-reflexivity and human self-consciousness- could not 
be important in the Ufe of the culture. They became legitimate and possible only 
when, at the end of the 1 8th century,* it became increasingly appropriate for 
humans to consider themselves as legitimate objects of inquiry. 
■ Seen in that Ught, assvxment may be understood as the modern functional 
equivalent of the traditional practice of religious adoration. It is not that assess- 
ments today lead to paeans of praise for and affirmations of the nobility and glory 
of human responsibiUty for human destiny. To the contrary. The dominant tone of 
such assessments is often that of despair, even if the fundamental source of the 
energy behind the pursuit of ordering and understanding lies in optimism and hope. 
Rather, the analogj eligious adoration derives from the realization that adora- 
tion of God's hand. work was the ritualized manner in which the extraordinary 
diversity and mystery of human affairs were managed, namely, by ascribing their 
totaUty and significance to Divine intention. So. on the modem scene -that is, from 
1800 untU now -the dominant response to the new awareness of human 

35 

36 . . 



ImplfCAtfons for Socmty 



responsibility for human history, and of human triviality in the world of 
non-human affairs, has been one which involves increasingly refined efforts to 
specify-that is, to as.<?ess-the role of humans in their own destiny and, at the 
phflosophicaf extremes, to try to create a language whereby the meaning of that 
role and of th^ act of assessment of the role can be formulated and uttered. 

We see, then, that what is common to the apparently greatly different responses 
of religious adoration and secular assessment is one striking function served by 
both; namely, the management of diverstty. This is not to say that religious 
affirmation and adoration and secular assessment serve only that function. It w<Ald 
be no problem at all for all of us here to specify another dozen functions which 
both adoration and assessment serve. But it is to say that at least a principal 
function of both is the management of diversity. Having said that much, one mjyor 
difference in the two approaches becomes immediately apparent. Religious adora* 
tion serves thai function by attribution of ali diversity to an alNpowerfuI Divine 
agency, leaving the diversity unanalyzed, unreduced and unordered* By contrast, 
secular assessment serves to make diversity manageable by reduction through 
classification, typologizing, factor analyzing, correlating* scaling; in short, by 
grouping diversities into manageable numbers of types, and measuring, comparing, 
and contrasting this reduced number of types so that informed choices can be made 
among them. 

The introduction of the term choose or choke implies another fundamental . 
difference between the era of adoration and that of assessment. It is the fact that a* 
crucial feature that accompanies the transition from the Society of God to the 
Society of Man (with apologies to women) is the emphasis on the pptionality of 
human existence and the possibility of change in the quality of that existence 
through human activity. Optlonality unavoidably means preference. We can come 
to prefer and make such preferences meaningful only when we adopt the perspet- . 
tive that our fates are not foreordained qr preformed or given by a more-than- 
human agency. Hence, we have the right to prefer and tiy to choose. Moreover, we 
have the capacity to make such choices, and we have the technology for informing 
such choices-namely, through comparative assessment of the varied offerings. 

The theme of optionality and choice has a still deeper significance that must be 
specified, especially sinc^ it is so strongly antagonistic to the notion of Divine 
creation of human existence and activity. I refer to the implicit notion that in 
modem society humans see themselves not only as agents of their own present 
» qualities and destinies, but as active creators of their future qualities and destinies 
as welL That is, we are not content with understanding the lawfulness of cur 
present existences. Rather, we insist, too, on'the power to create our individual and 
collective existences and states of being for the future as well. 
^ That theme-of continuous, intelligent, rational self^creatlon of individual and 
. social forms and functions- may be said to be central to educational assessment 
activities. For better or for worse, we seem quite fully caught up in the idea that we 
must understand human variability* not only to understand iU but to see what can 
be done about reducing it, or at least reducing its consequences when w6 consider 

. 36 

. ..... . \ ' ' . 



MelvlnM.TumIn 



those consequence undesirable. So, too, we have visions of a desirable society; its 
work force; its human»re!ationships; its good$ and services; the quality of life for its 
citizens; the psychological states of those citizens, tncludirjg their levels of anxiety, 
creativify, uttliaattpn of intelligence, and affective well-being. So we measure, test, 
compare, contrast: we assess, that is. the better to shape, fanhion, and control our 
futures and those of others. 



A New Antipathy 

Having said that, we now can sec more clearly why various assessment activitits, 
such as those of the Educational Testing Service and the College Bdard, have come 
to be viewed with so much suspicion and disdain, however, widely employed those 
various tests and evaluations continue to be- The crucial suspicion is that the 
assessments being performed will be*utUized only, or primarily, to select out from 
tht existing human stock those cohorts who measure up to certain specifications 
needed to preserve Hthe going prder of things. Put in the most invidious terms, the 
ass^sment of educational functioning is seen by those who decry it most as an 
instrument for the preservation of existing inequalities and the perpetuation of the 
privileges of existing elites and their offspring. The evaluations are seen a^ devoid 
of, or even antagonistic to, the idea that humans wit^i proper effort can shape new 
futures for old populations ^nd create new populations with new endowments, new 
capacities, new roles and functions. ' 

It is a mark of the tempo of change in modem society that while educational 
measurement experts feel they have barely begun to understand how the human 
mind and its various faculties work, and why populations differ in these manifest 
faculties, some segments of the lay public condemn the instruments of assessment 
as though th^y were already full-blown instruments of class and racial bias and 
malice. These are the laymen who feel most adversely aftected by the ways in 
which such evaluations have heretofore been utilized- 

This ne^ theme -this antipathy to assessment, especially among underprivileged 
sociocultural groups and their champions-is In some ways more like the earlier 
adoration of Divine omnipotence than the newer secular theme of comparison and 
contrast. For underlying the antipathy Co assessment is the partly religious notion 
that all humans are equally entitled to an equal share of the good things of life, 
regardless of the differences in their measured talents and abilities* All humans, in 
short, at? equal **childrca of God/* Hence, all efforts to measure them* typologize 
them* categorize them, and assign them different shares of the good things of life 
are tP be deplored. I have said that the underlying spirit is partly reUgious* But it Is 
also partly Marxist and partly humanist, and. in the final analysis^ it can be shown 
to be a direct outcome of certain phiiosophic;d and scientific works which also have 
led to the development of p^ychometrics and sociometrics. I am thinkii^g partic- 
ularly of the extraordinary way in 'which John Dewey» in Human Nature and 
Conduct, combines a respect for the fundamental equality of all human beings with 

37 



38 



Impltcatrons for Society 



an equal respect for the importance of rational inquiry and measurement of causes 
and consequences in human affairs. 

This leads me to assert that-al! notions to the contrary notwithstanding- there is 
no inherent moral quality to assessmejxt per «fe ItJs a technological instrument. As 
such, it is value-neutral and non-directional. It con be and has been utilized for the 
most diverse purposes. But Its availability and the state of its development do not 
determine its usages or direct human affairs. It i^ like all other technological 
products-Hke &F1 other machines, if you will. Humans decide why they want 
machines, what kinds of machines they want» and how they will use them. If a 
machine seems to be operating per manus Dei instead of per manus humanis, the 
single act needed to bring the machine under control i^ simply to pull out the plug. 

But that \% only a temporary measure, and one bom out of desperation. The fact 
that humans have great power to put assessment and other technological devices to 
the uses they themselves choose to serve suggests that some approach other than 
pulling out the plug would be desirable. If it is humans who finally determine how 
technology will be used, and it is, then it is humanly arranged social power that 
determines who will determine the uses of technology. 

Assessment ♦ in sum, has come to serve two enormously important functions^ 
namely, to enable humans to matmge diversity through reduction and ordering, and. 
simultaneously, to expand human existence and give it new and richer meanings by 
creating new ranges of options and choices, which can be made in more informed 
and ratioa-sl ways because of the enlightenment that a^ssment provides. How fuUy 
each of these two mjyor functions of assessment will be implemented is, finally, a 
human choice. 



38 



ERIC 



39 



A Look at Education Reporting 

By Fred M. Heohinger 



A visitor from another planet would» had he happetted to drop in on some recent 
Presidential press conferences, be forced to conclude that the- American sector of 
Planet Earth would be devoid of problems If only the news media Stopped creating 
them. Those anti-Nixon Intellectuals (educators included) who react to the charges 
apinstjhe news media with anger, however, are not entirely honest, if they imply 
that The President has invented these defensive tactics» As any veteran education 
writer can attest, educators ^re past masters,of the technique» 

For example, during a recent meeting with New York City school administrators, 
a principal launched a spirited attack against The New York Times practice of 
publishing the reading test scores for all the system's schools. The scores, he said, 
lead the uninformed public to draw all kinds of erroneous conclusions about the 
quality of the schools. •'Besides,'* he added, •*the tests are no damned good/* 

Having been the villain who originally urged, authorized, and assigned the under- 
cover operation that broke the secrecy that used to keep the scores hidden from 
pubHc view, I had to rine to the defense of f/rc Times. I suggested that publicajtion 
of the information could hardly leave the public in greater ignorance than it had 
been without the information. I conceded that it is the newspaper*8 responsibility 
to accompany the scores with all the available explanatory and interpretive 
information. Such efforts, I suggested, had bee« made honestly and diligently, but 
if specific omissions could be brought to our attention, they ought to be corrected. 
Finally, I asked: "If the tests are. in fact, *no damned good,' then why -are they stiH 
being used?'* (Except for the fact that we had repeatedly reported the experts' 
doubts about the tests in question, the principal might have triggered another kind, 
of story which would have pleased him even less than the original one.) 

The point I am trying to make is that educators, who tend to be politically very 
Insistent on the people's right to. know* are often not very consistent in applying 
this principle to their own activities. Just as Mr, Nixon has repeatedly complained 
that the media fail to r**r^rt what is right with America, educators regularly charge 
that the media are"^wiDbwing» if I ,may coin a phrase, in what is wrong with 
education. Many who cheered the disclosure of the Pentagon Papers nevertheless 
denounced the disclosure of the reading scores-and that's only a random, example. 

I am afraid there is no answer that will satisfy those who ask: Why is bad news 
given so much more space than good news? Good news is given quite a bit of space 
provided it is newsworthy and not merely routine. Few echica*ors who are also 
commuters would hail a newspaper' for commenting favorably on the fact that 
many trains tlo arrive on time. ^ 

The Rote of the Media 

Now for a few specific observations concerning what I consider to be the role and 
responsibility of the media tn« educational reporting. 

My 'first answer to that question is simply reporting. The media ought to report 
what is happening {and, of course, what is not happening). They ought to (ell what 
the establishment or the profession does-- and where it succeeds or fails. They ought 

. 39 

-40 




A Look at Education Rtportln9 



to report what the non- or anti-establishment forces are doing, planning, criticizing, 
and proposing* 

To accomptish the foi^most task-reporting-^e medja^ought to make available 
time and space in far. ni0£e.generottsinroportion to t^e rest oi the news than is now 
the case;^ am not saying this in a Utopian* pie-in^he^ky manner^ without full 
consideration, of media economics; I am merely asking for a more Htting allocation 
of time and space within the total scheme of coverage, (If advice-to*the-loveIom 
columns are an indisp';nsab!e public service, why not advice^n^ucation columns, 
. too? The ease load of the education*iom may* in fact, be greater than that of the 
lovelorn.) 

A second vital function, closely related to reporting and crudal to successful 
reportingt is. translating. Just ^ a science story ss useless if it cannot be understood 
by the lay reader, so is an education report meaningless if it is comprehensible only 
to the profession. I won*t bore you with a recitation of my 20*year struggle against 

* professional jargon; I merely suggest that not enough mf^mbers of the education 
profession will ever b^ taught comprehensible English to eliminate the need for 
education reporters who also are skilled translators. I realize that the translation 
often offends and even infuriates the professional "source" of a story; the cause for 
such displeasure is sometimes a faulty translatipn but more often the fact that the 
story» in translation, has lost its intended covft^up of the truth. That is precisely 
why good reporting and translation must go hand*in*hand. 

The third crucial requirement is interpretation, tt is the area in which the media 
are probably most vuteerable^ partly because it requires space but taore imffortantly 
because it requires qualified interpreters. (Let me come back to the latter require*^ 
ment a little later.) What is needed is a constat effort to place education news in 
historical national, and-even more important -international pir^pective. When \ 

• am told that the schools are the most oppressive institutions, I want to know in 
relation to what? To prisons? To corporations? To French schools! When I am 
infoAned that American schools have failed .to close the^ inequality gap, I want to 
know how well or badly they have done on 4hat score in comparison to British or 
German ^r Indian schools, i want to get some insight into the schools* historic 
mission-what was it? How well has it been carried out? 

The final category in which the media have a crucial fole to play is opinion. If I am' 
somewhat hesitant to say much about this function, it is largely because that's how I 
earn my living now/and that, of course, means that things are Just Hne. Seriously, 
though, I think quite the opposite is true. The editoiial pages-and that is where opin- 
ion ou^t to be conflned-are more often the place where reactionary publishers and 
superannuated editors parade their nostalgic memories about the good old days when 
the schools turned out such paragons of virtue as themselves. The theme, more often 
than not, is antipermissiveness and the extolling of the old-fashioned virtues and the 
work ethic-a la Agnew, you know. Honesty is the best polic^for others. 

Cleariy, the *they don*t make them like me anymore^' line about education is a 
useless totm of opinion. The less said about it the better-particularly the less said 
about it on editorial pages. 

40 

" ■• 41 



Fred M. Hechinoer 



This leads me to an ohssjrvation (hat covers all the categories- reporting, trans- 
lating interpreting, and opinion. Specialized personnel is essential, but unfor- 
tunately conspicuous by its absence. The trend Is one nf backsliding. The media are 
4ising fewer specialists in education today than they did a decade ago. 

Let me underscore that by speeialists I da not mean former teachers or graduates 
of teacher-training institutions or even people who have taken courses in educa- 
tional journalism. What lam asking for is people who have devoted .some y^iars to 
education coverage, and who have done so by preference rather than in surrender to 
orders from above. A lively interest in education -free of any bonds of allegiance io 
any educational dogpia and, even more important, to any educational organizations 
or groups -is all I ask for. 

What I also ask for is that the media keep such home-trained specialists on the 
education beat unless they eventually see fit to promote them to desk editorships 
or other exalted positions. 1 suggest that not as a lobbyist for the specialists but as a 
means of getting education-conscious people into policy-making positions. 

Unhappily, I must say that the course I am propo.sing is not currently being 
followed not by the newspapers (including my own, which keeps shifting people 
into and out of the education news department before their experience has a 
chance to pay off >. not by the newsmagazines, and least of all by the television 
networks, which, to the best of my knowledge, have no education spetiaiists or 
even consultants. The networks used to do some occasional, and even some very 
fine, education documentaries, but these efforts appear to have been declining ever 
since the various revolutions- student revolt, urban confrontation,and curricular 
ferment were called off for lack of money and interest. 

That^cads ttie to my final point-loss of interest. Education is currently in 4 deep 
slump Its outside support- forces appear to have scattered. The Nixon ^dmm- 
istration has undoubtedly viewed this development with satisfaction and. is not 
about to rally tl e citizenry to the support of an enterprise that might cost money. 
The schools and colleges are shell-shocked by recent unrest and current recession. 
Few ideas appear to be coming out of the academic establishment. The period of 
growth mJ expansion is over. 

The media cannot-in fact, must not -create action where there is no action. But 
the media are responsible for the coverage of education, and the city editors 
argumen' that the public is no longer interestet' ..imply does not hold' water. 

Lack of progress and absence of planning and leadership are- legitimate and 
important areas for coverage in education, just as lack of progress and absence of 
leadership are always considered legitimate and important are/s for coverage m 
politics and government. And. of course, instances bf action a;id leadership amid 
general depression and fatalism are that much more newsworthjf. 

/' 



/ 



41 



42 



# Some Points of interest 



Following the speakers' presentations, a number of comments anU^iuesttons were 
posed by the respondents and cdilors. 1 hese led to lively discussions in which 
several points of interest or concern were raised. 

One speaker chalked a squiTre on the board and rapidly divided it into quadrants. 
He labeled the columns Educational Uses \ind.Noneducational Use, \ and the rows 
Fonml Ediuafumat Lxpvrienves and !Von formal hduvafional Expehvnces, Then he 
asked ETS staff members in the audience to rank the quadrants-in terms of the 
current state of measurement science. In other words, he was asking: Is it possible 
to measure an individual's formal learning equally welt for educational and 
noneducatlonal purposes? Can informal learning be measured a4 precisely as formal 
teaming? 

One response: The greatest expertise is in measuring format learning for educa- 
tional uses, the least in measuring informal learning for noneducatlonal uses; the 
relative expertise in the other two quadrants is debatable. "It's easier to agree on 
what^akes a good law student," this staff member observed, *than on what makes 
a good lawyer. * 

After the session, another staff member dismissed the speaker's questions as an 
"academic excrcisc.V While it certainly was that, it nevertheless stirred the interest 
of several editors and writers, and as the conference unfolded, the reason became 
obvious. There is widespread uncertainty about the state of the science. How 
precise are test scores? How accurate are they in predicting academic and 
occupational performance? How relevant are they to the purposes -admission to 
college^ asses.sment of schools and colleges, hiring, professional certification and 
occupational' licensing -for which they ar? used? 

These questions were raised directly or indirectly, and surfaced throughout the 
conference. So did some of the answers. Measurement is reasonably exact in the 
aggregate* but precision cannot be guaranteed in an individual case. Reputable tests 
measure well what they are designed to measure, but that is not always what the 
public thinks they measure. As for the relevance of measures to purposes, it 
depends on the proper use of tests. For example, if a test unrelated to job 
requirements is used to screen potential employees, that test is being misused. 

The issue of fairness in testing emerged as a prime concern of conference partici* 
pants. Their doubts stemmed in part from popular protests that tests are cuhurally 
biased . against low4ncome and thinority groups and do not predict on-the-job or 
in-t he-classroom performance accurately for members of those groups. 

It. would not be accurate to say that consensus was reached on these issues. There 
have been documented cases of test use for the purpose of discriminating against 
minorities, \?\xi they usually demonstrate that the tests were misused, not that they 
were planned instruments of oppression. The preponderance of available evidence 
indicates that* while minority group members and the poor do not score as high on 
the average as majority group members and the more affluent, test scores predict 
individual performance about equally well for all. 

One of the few heated exchanges of the conference was related to the issue of 
cultural bias. One speaker asserted that tests are vatue-neutraL In the subsequent 

42 



43 



.discussion, a writer countered that tests tencct the values of the colleges doing the 
admitting or the industries doing the hiring. I he values of these institutions in turn 
reHcct the values of society in general. So the tests are valwlaJefi, not value- 
neutral. But the speaker stood his ground : Tests measure the individuaFs knowledge 
or skill in a specified field at a particular time. Values are applied only by those 
who interpret and use test scores. 

Subsequent informal conversations among participants indicated that the impasse 
was a matter of semantics. Tests are not value-free. They do reflect the values of 
society and its institutions. As one bTS staff member said, a test that is free of 
values free of cultural innuences. is not likely to measure anything of practical use. 
On the other hand* tests are value-neutral In that they are passive instruments tliat 
can be used for good or evil. The same test that bars someone from a school or a 
job might be used as a guide to remedial education or additional training. 

Another concern about fairness in tealing apparently sprang from apprehension 
that the Increasing use of tests for occupational licensing and professional certifica- 
tion and simply for hiring would close more doors than it opened for minorities 
and the poor. A writer asked whether wider use of perforittance-based tests for 
persons entering various fields would not spark protests from those being tested. He 
said the tests were likely to be seen as barriers to economic opportunity just as 
■ college admission tests have been considered barriers to educational opportunity. 
To illustrate the point, he envisioned a young black man who graduates from a 
teacher-training university and suddenly discovers there is a new huWle 'ojump 
before he can get his teaching certificate. In the past, a degree was all that was. 
needed Now a single test can block his way. Won't he feel that it's unfair? 

Answers came from two HTS staff members. One said the nature of the tests will 
determine, to some extent, how much hostility they arouse, if a test is shrouded in 
secrecy, and failure is final, an adverse reaction is inevitable. But performance-based 
tests are being designed as diagnostic tools with feedback to the candidate so weak- 
nesses can be corrected through further study or experience, the other staff mem- 
ber noted that, in any case, measurement agencies have an obligation to demon- 
strate that their tests are valid and to insure that they are "used in valid ways. 

One journalist wanted to know who determines or should determine the criteria 
for measurement in a given case: Schools and colleges? The federal government? 
Occupational or professional practitioners? And what role does the measurer ent 
agency play? For instance, if questionable criteria are set forth, does ETS accept or 
challenge them? The answer was "any of the above" might contract with ETS or 
another organization to develop a testing, assessment, or evaluation prograrti, and 
whoever initiates the program retains ultimate control of the criteriar But fcTS 
certainly has and uses - opportunities to infiuence its clients* decisions, and can 
refuse to develop a program it believes is invalid. A staff member added that 
occasionally ETS gets a chance to advise policy-makers generally, outside the 
framework of specific program development. The U. S. Department of Labor, for 
example, once asked ETS to examine how states and municipalities handle 
occupational licensing and suggest improvements. 

43 

o . .44 



That exchange gives a hint of two recurrent themes that threaded through the 
discussions. One was the rcsponsfhility of measurement age:ictes in general, and 
fits in particular, to various publics. This concern first arose in an editor's question 
relating to the favorable ETS evaluation <yf Sesame Street for Children's Television 
Wc^rkshop (CTW). That study, the editor said, was criticized by some on the 
grounds that the wrong questions were asked, that CTW set the boundaries of the 
J inquiry and ETS simply worked within them. Was that true? And if so, where does 
ETS* responsibility fie? Although he didn't say so specifically, he seemed to be 
asking whether ETS did or did not feel an overriding obligation to serve the 
interests of the viewing public. 

The, issue is complex, and so was the answer. ETS is obviously obligated to meet 
its client*s requirements. Yet, beyond that, ETS does have, and recognize, a 
responsibility to the public interest. When the two are in conflict, ETS must try to 
resolve the differences. It must use reafon and expertise to influence the client. In 
the case of Sesarrte Street, there was no such conflict. What CTW wanted w,as 
legitimate: an evaluation of Sesame Street in terms of whether it was meeting the 
goals set for it by CTW. ETS was not specifically asked to evaluate the goals to see 
^•whether they were the right ones*' or examine cost benefits. As performed the 
evaluation was legitimate and honest, and it certainly was not inimical to the public 
int-;rest. ^- 

Echoes of this question, pinpointing a measurement ageney*s relative responsi- 
bility Tir-the,Jace of competing interests, sounded frequently and related to testing 
programs as welT^^s-xegearch and evaluation. The majority of test-takers are 
students, most of them seeking adimssipn to colleges, graduate schools, and profes- 
sional schools. It was logical, therefore, tharthe^rejated theme manifested at this 
conference was the responsibility of measurement agen7ies7&-sUi^nts. 

A key question, rep^(fted in various ways, was asked by oneeattoi^^ls it ETS 
policy to provide students with full information about test scores and theifTH:Qper 
interpretation and how colleges iise them? The answer was yes -to the extent^ 
possible. Test -takers receive much information from ETS arid the program sponsor 
about their own scores and how to interpret them. Details about how colleges and 
universities interpret and use test scores in making admissions decisions vary 
considerably from one institution to another. Thus each institution is responsible 
for informing students about its own procedures. In general both the sponsors and 
ETS want to protect the students* interests and provide them with all the facts 
possible. One staff member noted that much more information is available to 
students taking the College Board tests today than was available to the students of 
years ago, 

Two suggestions for change came from the participants. An editor asked it ETS 
might shift gears, in light of higher education's metamorphosis from a seller's to a 
buyer*s market, and **test" colleges in order to provide a sort of shopper's guide for 
potential students. He acknowledged that ETS wa*- :ot likely to find an organiza- 
tion to sponsor such an activity, but his question was serious. In answer, a staff 
mekiber called attention to one such attempt, the College Locater Service 

44 




sponsored hy the toUegc Board and operated by ETS. Through this service, 
students submitted forms indicating the kind of institution they wanted to attend, 
and ETS matched their descriptions with data on fl!e, then forwarded the informa- 
tion to them. 

The other suggestion came from one of the respondents, a graduate student, who 
„_said4esting agencies and students need to '*come together** for their mutual benefit. 
Rather than a new service for students, she proposed a continuing interchange 
between agency personnel and students, it would help students understand testing 
and the uses of test scores, and it would help the agencies evaluate their programs. 
She advocated three types of activity: 1 ) Tijsting agencies should send exhibits and 
emissaries to the campuses; 2) they should hpld conferences for representative 
students who woifld relay what they learned to their fellow students; and 
3) students should be included in the agencies* decision-making and evaluation 
. processes through appointment to trustee and advisory boards, and to. research 
internships. 

Another topic that figured prominently in the discussions was the evaluation of 
e<iucatUmal programs and institutions. One writer asked: if resources are too 
limited for comprehensive evaiuaiiun of a school system, is some better than none? 
The answer was yes, if the variables to be studied are carefully selected on the basis • 
of their im* ortance as well as the cost of measuring them. If cost alone is the 
determining factor, the school system may end up with mountains of trivial data. A 
subsequent question was: Would a "systems approach*' to evaluation make usable 
any of the useless information now available in abundance in some schools? The 
response was yes, some of it might be rendered useful, but provision must be made 
to Incorporate it into a continuing evaluation plan. 

What can be done, an editor asked, about the resistance -from classroom teachers, 
for Instance-to evaluation that seems to threaten the status quo? The answer: If 
evaluation is needed for sound educational reasons-and it is-then the resistance 
must be overcome. If the goal is to improve learning and good teachers f^el 
threatened by the proposed process of evaluation, the process might be modified. 
Conversely, the good teachers, if they understand the purpose, may be persuaded to 
see the process differently. 

Finally,, conference participants paid cohsiderable attention to new kinds of tests 
to meet the n^d$ of a changing society. Trends and prospects have been exam Vied 
in some of the papers in this volume, and there is no need to review them here. But 
a pertinent observation, expressed duU^g a discussion session, does merit reporting. 
It was that, iiTth^uture, the techniques of measurement will be applied not only 
to aptitudes and adRiev^ent levels but to more subtle indicators of personal and 
intellectual functioning. IhMdll become possible, in other words, to measure not 
only "what makes a good law^Student*"btU.^hat makes a good lawyer.** 




45 



\ 



ROSTER OF PARTICIPANTS 



Speakers 

Samaei Ball, R search t^ychotogist, ETS 

K. PatriciiX Cross, Senior Research Psychologist » ETS 

Fred W. Hechinger, Editorial Board, New York Times ^ 

WintoR H, Manning, Vice President, ETS 

Samuel J. Messick, Vice: President, ETS 

Melvin M. Tumin, Professor of Anthropology and Sociology, Princeton University 
y-^t^tlBaS* W, TumbuU, President, ETS 

Respomlents 

Myron Corman, Department of Urban Education, University College, Rutgers 
University 

Carl Crawford, Superintendent, Elementary District, Flemington-Raritan Regional 
Public Schools 

Jerrold K. Footlick, Education Editor, Newsweek 

Robert Glaser, Co-Director, Learning Research and Development Center, University 
of Pittsburgh 

* George Harris, Program Officer, Southern Regional Council, Inc. 
Marsha Levy, Graduate Student, Yale University 

Ruth Mancuso. Chairman, Committee on Evaluation, New Jersey State Board of 
Education 

Carmen C. Scott, Assistant Program Director, ETS 

Constance Shally, Content Specialist, Foreign Language Department, Hightstown 
High School 

Session Chairmen 

Richard S, Levine, Vice President, ETS 

Robert J. Solomon, Executive Vice President, ETS 

John Summerskill, Vice President, ETS 

E. Behrin Williams, Vice President, ETS ' 

46 . . . 

,.; . 47 



Theodore Bleecker, Editor-in-C'hief, The New Yifrk Teacher 

Jenne K. BriteU, Director, Information Services, ETS 

Mary Chcuse* Afwjstant College Editor^ Glamour 

John /\. Crowl» Managing Editor* The Chronicle oj Higher Education 

Walter A. Graves, Executive Editor, Today's Hducath^n 

Anne S. Hopkms, Education Writer, Time 

Gene I. Maeroff , Education Writer, New York Times 

Loretta Hunt Marion, Editor, Scholastic Teacher 

Carole Martin, Education Editor, Associated Press 

Ted Miller, Associate Editor, Changing Times 

John P. Smith, Co^Managej*, Media l^elations, ETS 

Midge Turk, College Editor, Glamour 

Melinda Upp, Assistant to the Director for Public Information, National Institute of 
Educa^k>n 

Jim Warren, Editor « A FA Monitor 

Heather Weiss, Editorial Board, Harvard Educational Review 
Richard Yurko, News Editor* The Dartmouth 

Conference Chairmen 

Scarvia Anderson, Vice President and Director of the Atlanta Office, ETS 
Jane Wirsig, Executive Director, Information Services and Public«ttions, ETS 

Conference Staff 

Vivian E. Bra;idle, Administrative Assistant to Ms. Wirsig, ETS 

Gail Carlin, Sec retary to Ms. Britell, ETS 

Loretta ^C. Es2tari, Secretary, Information Services, ETS 

47 

48 



